Compositions and methods for treatment of muscular dystrophy

ABSTRACT

The present invention features recombinant adeno-associated vectors for delivery of genes to both skeletal and cardiac muscle and methods for treatment of muscle defects, including Duchenne&#39;s muscular dystrophy.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to Italian provisional application TO2013A000669, filed on Aug. 5, 2013, the entirety of which is incorporated herein by reference.

BACKGROUND OF THE INVENTION

The present invention relates to compositions including recombinant adeno-associated vectors (AAVs) comprising genes to be expressed in skeletal and cardiac muscle tissue, as well as methods for treatment of muscular defects.

Duchenne's muscular dystrophy (DMD) is a severe X-linked muscle degenerative disease caused by the absence of the cytoskeletal protein dystrophin. The dystrophin protein provides stability to the sarcolemma (i.e., the cell membrane of muscle cells) by linking the intracellular cytoskeletal network to the extracellular matrix. In the absence of dystrophin, muscle contraction mechanically stresses the cell membrane, inducing progressive damage to the myofibers. Initially, skeletal muscles are predominantly affected; however, as the disease progresses, the damage extends to cardiac muscle and death is caused by respiratory or cardiac failure. DMD is one of the most common genetic diseases, affecting an estimated 1 out of every 3,600 male births each year. DMD is a debilitating disease that progressively worsens over the short (approximately 25 year) lifespan of those affected. Unfortunately, despite years of research, there are no curative treatments for muscular dystrophies including DMD. Current therapies are limited to managing symptoms.

Gene transfer by recombinant AAV has gained increasing value in recent years for experimental and human gene therapy. AAVs are small viruses that elicit a very mild immune response and do not cause any known human diseases. Additional advantages of these viral vectors include the existence of several serotypes ensuring a wide spectrum of tissue-specific tropism, as well as a robust, long-term transgene expression in vivo. Many successful preclinical studies on animal models have paved the way for the use of recombinant AAV-based gene delivery for clinical applications. However, the extremely large size of the dystrophin gene (approximately 2,400 kilobases or kb) has complicated efforts for gene therapy to treat DMD or related dystrophinopathies, because the packaging size of inserts into recombinant AAV vectors is limited to about 4.7 kb.

Therefore, a need exists for improved muscle-specific recombinant AAV vectors for delivery of genes to both skeletal and cardiac muscle. There also exists a long-standing need for compositions and methods for treating muscular dystrophies, especially DMD.

SUMMARY OF THE INVENTION

In a first aspect, the invention features a recombinant adeno-associated vector (AAV) for expression of a gene in skeletal or cardiac muscle tissue, the vector including a muscle-specific promoter and a gene encoding a fusion protein, wherein the fusion protein includes a) a transcriptional activation element and b) a DNA binding element, wherein the fusion protein, when expressed in the skeletal or cardiac muscle tissue, is capable of increasing utrophin expression.

In some embodiments of the first aspect, the gene is expressed in skeletal muscle. In some embodiments, the gene is expressed in cardiac muscle. In certain embodiments, the gene is expressed in skeletal and cardiac muscle. In some embodiments, the vector further includes at least one element selected from the group consisting of an inverted terminal repeat, a cap signal, a multicloning site, an intron splice-donor site, an intron splice-acceptor site, an epitope tag, a nuclear localization sequence, and a polyadenylation consensus sequence. In some embodiments, the epitope tag is a myc tag. In some embodiments, the intron splice-acceptor site is a beta-globin intron splice-acceptor site. In some embodiments, the transcriptional activation element is Vp16.

In some embodiments of the first aspect, the DNA binding element binds a mouse or human utrophin promoter. In some embodiments, the mouse or human utrophin promoter includes the sequence of SEQ ID NO:9. In some embodiments, the DNA binding element includes one or more zinc fingers. In certain embodiments, the DNA binding element is selected from the group consisting of Jazz, Bagly, and UtroUp or variants thereof. In some embodiments, the muscle-specific promoter is selected from the group consisting of alpha-actin, cardiac troponin C, myosin light chain 2A, skeletal beta-actin, CK6, dystrophin, and muscular creatine kinase. In some embodiments, the vector has a serotype of AAV8 or AAV9. In some embodiments, the transcriptional activation element is Vp16 and the DNA binding element is Jazz. In some embodiments, the muscle-specific promoter is alpha-actin, the transcriptional activation element is Vp16, and the DNA binding element is Jazz. In some embodiments, the vector includes the sequence of SEQ ID NO:83.

In a second aspect, the invention features a composition including any one of the preceding vectors and a pharmaceutically acceptable carrier.

In a third aspect, the invention features a method of treating Duchenne's Muscular Dystrophy in a subject in need thereof, the method including administering an effective amount of a composition of the second aspect. In some embodiments, the composition is administered intraperitoneally.

In a fourth aspect, the invention features a method of treating Duchenne's Muscular Dystrophy in a subject in need thereof, the method including contacting the utrophin gene of a muscle cell with a fusion protein, wherein the contacting results in an increase in utrophin expression. In some embodiments, the fusion protein binds to a promoter of the utrophin gene. In certain embodiments, the promoter includes a sequence selected from the group consisting of SEQ ID NO: 9, 10, 11, 12, 13, and 14. In some embodiments, the fusion protein includes a sequence selected from the group consisting of SEQ ID NO: 71, 72, and 73.

In a fifth aspect, the invention features a recombinant adeno-associated vector (AAV) for expression of a gene in skeletal or cardiac muscle tissue, the vector including a muscle-specific promoter and a gene encoding a fusion protein, wherein the fusion protein includes a) a transcriptional activation element and b) a DNA binding element, wherein the fusion protein, when expressed in the skeletal or cardiac muscle tissue, is capable of increasing utrophin expression, increasing muscle contractile force, or increasing muscle endurance.

In one embodiment of the fifth aspect, the transcriptional activation element is Vp16 and the DNA binding element is Jazz. In a second embodiment, the muscle-specific promoter is alpha-actin, the transcriptional activation element is Vp16, and the DNA binding element is Jazz. In a third embodiment, the vector includes the sequence of SEQ ID NO:83, 84, or 85.

In a sixth aspect, the invention features a recombinant adeno-associated vector (AAV) for expression of a gene in skeletal or cardiac muscle tissue, the vector including a muscle-specific promoter and a gene encoding a fusion protein, wherein the fusion protein includes: a) a transcriptional activation element and b) a DNA binding element, wherein the fusion protein, when expressed in the skeletal or cardiac muscle tissue, is capable of increasing utrophin expression, increasing muscle contraction force, or increasing muscle endurance, and wherein the vector does not include the sequence of SEQ ID NO:83. In one embodiment of the sixth aspect, the vector includes the sequence of SEQ ID NOs: 84 or 85.

In one embodiment of the fifth and sixth aspects, the gene is expressed in skeletal muscle. In some embodiments, the gene is expressed in cardiac muscle. In a second embodiment, the gene is expressed in skeletal and cardiac muscle. In a third embodiment of the fifth and sixth aspects, the vector further includes at least one element selected from a group consisting of an inverted terminal repeat, a cap signal, a multicloning site, an intron splice-donor site, an intron splice-acceptor site, an epitope tag, a nuclear localization sequence, and a polyadenylation consensus sequence. In certain embodiments, the intron splice-acceptor site is a beta-globin intron splice-acceptor site. In a fourth embodiment of the fifth and sixth aspects, the transcriptional activation element is selected from a group consisting of Vp16, Vp64, p65, SP1, Gal4, and CJ7. In a fifth embodiment of the fifth and sixth aspects, the DNA binding element binds to a promoter of the utrophin gene.

In certain embodiments, the utrophin promoter is a mouse or human utrophin “A” promoter, wherein the mouse or human utrophin “A” promoter includes the sequence of SEQ ID NO:9 or SEQ ID NO:14. In other embodiments, the human utrophin “A” promoter includes the sequence of SEQ ID NO:10 or SEQ ID NO:12. In yet other embodiments, the mouse utrophin “A” promoter includes the sequence of SEQ ID NO:11 or SEQ ID NO:13.

In another embodiment of the fifth and sixth aspects, the DNA binding element includes one or more zinc fingers. In yet another embodiment, the DNA binding element is selected from the group consisting of Jazz, Bagly, or UtroUp, or variants thereof. In yet other embodiments, the DNA binding element is Jazz or variants thereof.

In one embodiment of the fifth and sixth aspects, the muscle-specific promoter is constitutively expressed throughout differentiation. In a second embodiment, the muscle-specific promoter is selected from the group consisting of alpha-actin, cardiac troponin C, myosin light chain 2A, skeletal beta-actin, CK6, dystrophin, muscular creatine kinase, dMCK, tMCK, enh348MCK, synthetic C5-12 (Syn), Myf5, MLC1/3f. MyoD1, Myog, and Pax7. In a third embodiment of the fifth and sixth aspects, the vector has a serotype selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, and AAV9. In a fourth embodiment, the vector is muscle tropic. In a fifth embodiment, the vector has a serotype of AAV8 or AAV6.

In some embodiments of the fifth and sixth aspects, the muscle endurance is measured in a forced exercise assay, wherein the muscle endurance is increased in the range of about 5% to about 50% relative to baseline. In another embodiment of the fifth and sixth aspects, the utrophin expression is increased by at least 1.5 fold. In yet another embodiment of the fifth and sixth aspects, the muscle contractile force is increased by at least about 10%.

In a seventh aspect, the invention features a composition including any one of the vectors of the fifth and sixth aspects and a pharmaceutically acceptable carrier.

In an eight aspect, the invention features a method of treating a muscle disease in a subject in need thereof, the method including administering an effective amount of a composition of the seventh aspect. In some embodiments, the composition is administered systemically or locally. In some embodiments, the composition is administered intramuscularly, intravenously, subcutaneously, or intraperitoneally.

In a ninth aspect, the invention features a method of treating a muscle disease in a subject in need thereof, the method including contacting the utrophin gene of a muscle cell with a fusion protein, wherein the contacting results in an increase in utrophin expression, an increase in muscle contractile force, or an increase in muscle endurance. In some embodiments, the fusion protein includes a sequence selected from the group consisting of SEQ ID NOs:71, 72, and 73.

In a tenth aspect, the invention features a method of treating a muscle disease in a subject in need thereof, the method including contacting the utrophin gene of a muscle cell with a fusion protein, wherein the contacting results in an increase in utrophin expression, in increase in muscle contractile force, or an increase in muscle endurance, wherein the fusion protein does not include the sequence of SEQ ID NO:71. In some embodiments, the fusion protein includes a sequence selected from the group consisting of SEQ ID NOs:72 and 73.

In one embodiment of the ninth and tenth aspects, the fusion protein binds to a promoter of the utrophin gene. In some embodiments, the promoter is a mouse or human utrophin “A” promoter. In a second embodiment, the promoter includes a sequence selected from the group consisting of SEQ ID NOs: 9, 10, 11, 12, 13, and 14. In a third embodiment, the treating or the contacting results in an increase in muscle contractile force or an increase in muscle endurance. In a fourth embodiment, the muscle disease is a muscular dystrophy, wherein the muscular dystrophy is selected from the group consisting of: Duchenne's Muscular Dystrophy or Becker's Muscular Dystrophy, congenital muscular dystrophy, facioscapulohumeral muscular dystrophy, limb-girdle muscular dystrophy, myotonic muscular dystrophy, and oculopharyngeal muscular dystrophy.

The invention features a modified transcription factor including at least a first, a second, and a third zinc finger motif, wherein the transcription factor is capable of increasing utrophin expression when expressed in skeletal or cardiac muscle tissue. In some embodiments, the transcription factor further includes a fourth, a fifth, a sixth, a seventh, an eighth, or a ninth zinc finger motif. In certain embodiments, each zinc finger motif includes an alpha-helix. In some embodiments, the transcription factor is derived from a genomically-encoded human transcription factor. In some embodiments, the genomically-encoded transcription factor is Zif268.

In one embodiment, the first zinc finger motif can include a sequence having at least 80% sequence identity to SEQ ID NO:48, the second zinc finger motif can include a sequence having at least 80% sequence identity to SEQ ID NO:49, and the third zinc finger motif can include a sequence having at least 80% sequence identity to SEQ ID NO:50.

In a second embodiment, the first zinc finger motif can include SEQ ID NO:48, the second zinc finger motif can include SEQ ID NO:49, and the third zinc finger motif can include SEQ ID NO:50.

In a third embodiment, the transcription factor can include a sequence having at least 95% sequence identity to SEQ ID NO: 38 and wherein the transcription factor includes: i) a first zinc finger motif including an alpha-helix which contains an Arg residue at position −1, a Glu residue at position 3, and an Arg residue at position 6, ii) a second zinc finger motif including an alpha-helix which contains a Ser residue at position −1, an Arg residue at position 1, a Val residue at position 3, an Arg residue at position 5, and Arg residue at position 6, and an Asn residue at position 8, and iii) a third zinc finger motif including an alpha-helix which contains a Ser residue at position −1, an Arg residue at position 1, a Val residue at position 3, a Leu residue at position 4, an Arg residue at position 5, an Arg residue at position 6, an Asn residue at position 8, and an Arg residue at position 9

In a fourth embodiment, the first zinc finger motif can include a sequence having at least 80% sequence identity to SEQ ID NO:51, the second zinc finger motif can include a sequence having at least 80% sequence identity to SEQ ID NO:52, and the third zinc finger motif can include a sequence having at least 80% sequence identity to SEQ ID NO:53.

In a fifth embodiment, the first zinc finger motif can include SEQ ID NO:51, the second zinc finger motif can include SEQ ID NO:52, and the third zinc finger motif can include SEQ ID NO:53.

In a sixth embodiment, the transcription factor can include a sequence having at least 95% sequence identity to SEQ ID NO: 39 and wherein the transcription factor includes: i) a first zinc finger motif including an alpha-helix which contains an Arg residue at position −1, an Asn residue at position 3, a Val residue at position 5, and an Arg residue at position 6, ii) a second zinc finger motif including an alpha-helix which contains an Arg residue at position −1, a His residue at position 3, and a Thr residue at position 6, and iii) a third zinc finger motif including an alpha-helix which contains a Asp residue at position −1, a Pro residue at position 1, a Gly residue at position 2, a His residue at position 3, a Leu residue at position 4, a Val residue at position 5, an Arg residue at position 6, an Asn residue at position 8, and an Arg residue at position 9.

In some embodiments of the invention, the transcription factor can include an amino acid sequence of SEQ ID NO: 38 or SEQ ID NO: 39. In another embodiment of the invention, the transcription factor is substantially non-immunogenic when expressed in humans. In yet another embodiment, the transcription factor is capable of increasing muscle contractile force, wherein muscle contractile force is measured in an in vivo, ex vivo, or in situ assay. In a further embodiment, the transcription factor is capable of increasing muscle endurance, wherein muscle endurance is measured by a forced exercise assay. In some aspects of this embodiment, the muscle endurance is increased by at least 10% compared to reference.

The invention also features a recombinant adeno-associated vector (AAV) for expression of a gene in skeletal or cardiac muscle tissue, including a muscle-specific promoter and any of the preceding transcription factors. In some embodiments, the gene is expressed in skeletal and cardiac muscle tissue. In some embodiments, the muscle-specific promoter is constitutively expressed throughout differentiation. In certain embodiments, the muscle-specific promoter is selected from the group consisting of alpha-actin, cardiac troponin C, myosin light chain 2A, skeletal beta-actin, CK6, dystrophin, muscular creatine kinase, dMCK, tMCK, enh348MCK, synthetic C5-12 (Syn), Myf5, MLC1/3f, MyoD1, Myog, and Pax7.

In one embodiment, the vector can have a serotype selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, and AAV9. In some embodiments, the vector is muscle tropic. In certain embodiments, the vector has a serotype of AAV6 or AAV8. In another embodiment, the vector can further include at least one element selected from a group consisting of an inverted terminal repeat, a cap signal, a multicloning site, an intron splice-donor site, an intron splice-acceptor site, an epitope tag, a nuclear localization sequence, and a polyadenylation consensus sequence. In yet another embodiment, the vector includes the sequence of SEQ ID NO:86 or SEQ ID NO:87. In any embodiment of the invention, the transcription factor specifically binds to a promoter of the utrophin gene, wherein the utrophin promoter is a mouse or human utrophin “A” promoter. In certain aspects of this embodiment, the mouse or human utrophin “A” promoter includes the sequence of SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, or SEQ ID NO:40.

The invention also features a composition including any one of the preceding vectors and a pharmaceutically acceptable carrier. The invention further features a method of treating a muscle disease in a subject in need thereof, the method including administering an effective amount of the preceding composition. The invention also features a method of treating a muscle disease, the method including contacting the utrophin gene of a muscle cell with any one of the preceding modified transcription factors. In some embodiments, the contacting results in an increase in muscle contractile force or an increase in muscle endurance.

In some embodiments, the composition is administered systemically or locally. In other embodiments, the composition is administered intramuscularly, intravenously, subcutaneously, or intraperitoneally. In any of the preceding methods of the invention, the treating can result in an increase in muscle contractile force or an increase in muscle endurance. In any aspects of the invention, the muscle disease can be a muscular dystrophy, wherein the muscular dystrophy is selected from the group consisting of: Duchenne's Muscular Dystrophy or Becker's Muscular Dystrophy, congenital muscular dystrophy, facioscapulohumeral muscular dystrophy, limb-girdle muscular dystrophy, myotonic muscular dystrophy, and oculopharyngeal muscular dystrophy.

DEFINITIONS

The term “about” is used herein to mean a value that is ±10% of the recited value.

As used herein, by “administering” is meant a method of giving a dosage of a composition described herein (e.g., a composition comprising a recombinant AAV vector comprising a polynucleotide encoding a fusion protein or modified transcription factor capable of increasing utrophin expression) to a subject. The compositions utilized in the methods described herein can be administered by a route selected from, e.g., parenteral (for example, intravenous or intraperitoneal), dermal, transdermal, ocular, inhalation, buccal, sublingual, perilingual, nasal, rectal, topical, and oral. The compositions utilized in the methods described herein can also be administered locally or systemically. The preferred method of administration can vary depending on various factors (e.g., the components of the composition being administered and the severity of the condition being treated).

“Amino acid sequence” as used herein, refers to an oligopeptide, peptide, polypeptide, or protein sequence, and fragments or portions thereof, and to naturally occurring or synthetic molecules.

By “cardiac muscle” is meant the form of striated muscle tissue that is found in the heart which is under the control of the autonomic nervous system, i.e., it is involuntarily controlled. The cells that constitute cardiac muscle are called cardiomyocytes or myocardiocytes. The heart is an organ that is composed mostly of cardiac muscle and connective tissue.

By “modified transcription factor” is meant a transcription factor that is substantially derived from a genomically-encoded transcription factor. Modified transcription factors can be derived from transcription factors known in the art, for example, from zinc finger transcription factors. The zinc finger motifs of a transcription factor (e.g., a human transcription factor) can be replaced, modified, or engineered to change the target sequence that the transcription factor specifically recognizes and binds. Modified transcription factors derived from human transcription factors are referred to as “modified human transcription factors” herein. One exemplary human zinc finger transcription factor is Zif268, also known as “Early growth response protein 1,” “Egr1,” or “EGR-1”. The protein encoded by Zif268 belongs to the EGR family of C₂H₂-type three-zinc-fingers containing proteins. Zif268 is a nuclear protein that functions as a transcriptional regulator. The principal isoform of human Zif268 protein contains 543 amino acids with a molecular weight of 57.5 kDa and binds the DNA target sequence: 5′-GCGTGGGCG-3′ (SEQ ID NO: 44). An exemplary amino acid sequence of human Zif268 is shown in SEQ ID NO:41. An exemplary mRNA sequence of human Zif268 can be found, e.g., under NCBI Accession Number NM_001964 or in SEQ ID NO:19.

As used herein, “conservative variations” or “conservative modified variations” of a particular sequence refers to amino acids encoded by nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids may encode any given peptide. Such nucleic acid variations are silent variations, which are one species of conservatively modified variations. One of skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine) can be modified to yield a functionally identical molecule by standard techniques. Accordingly, each silent variation of a nucleic acid which encodes a peptide is implicit in any described amino acid sequence. Furthermore, one of skill will recognize that individual substitutions, deletions or additions which alter, add or delete a single amino acid or a small percentage of amino acids in an encoded sequence are conservatively modified variations where the alterations result in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. The following six groups each contain amino acids that are conservative substitutions for one another: 1) Alanine (A), Serine (S), Threonine (T); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W).

By “contact” or “contacting” as used herein is meant to allow or promote a state of immediate proximity or association between at least two elements. For example, the utrophin gene can be contacted with a fusion protein or modified transcription factor of the invention that specifically binds a target sequence in the utrophin “A” promoter, thereby increasing expression of utrophin.

A “deletion”, as used herein, refers to a change in either amino acid or nucleotide sequence in which one or more amino acid or nucleotide residues, respectively, are absent.

By “DNA binding element” is meant an element capable of binding DNA. A DNA binding element may bind specifically to a target sequence or response element of a gene, e.g., in a promoter. Exemplary DNA binding elements include but are not limited to: zinc finger domains (e.g., domains that include zinc finger motifs), helix-turn-helix motifs, leucine zipper domains, winged helix domains, winged helix turn helix domains, helix-loop-helix domains, HMG box domains, Wor3 domains, immunoglobulin domains, B3 domains, TAL effector DNA-binding domains, RNA-guided DNA-binding domains, ZFP51, where the DNA target sequence of ZFP51 is represented by SEQ ID NO:15, and any described herein (e.g., Jazz, UtroUp, Bagly, or the zinc finger motifs of JZif1 or JZif2, or an element having the sequence of SEQ ID NO:16-18 or SEQ ID NO:42,43, or having at least 70% (e.g., 72%, 75%, 80%, 81%, 85%, 90%, 95%, 99%) identity to that of SEQ ID NO:16-18 or SEQ ID NO:42,43 or any of the zinc finger motifs described in Tables 2 and 6.

By “an effective amount of a composition” is meant the amount of a composition administered to improve, inhibit, or ameliorate a condition of a subject, or a symptom of a disorder or disease, e.g., a muscle disease, in a clinically relevant manner (e.g., improve, or ameliorate DMD, or inhibit degeneration of muscle tissue in DMD). Any improvement in the subject is considered sufficient to achieve treatment. Preferably, an amount sufficient to treat is an amount that reduces, inhibits, or prevents the occurrence or one or more symptoms of DMD or is an amount that reduces the severity of, or the length of time during which a subject suffers from, one or more symptoms of DMD (e.g., by at least 10%, 20%, or 30%, more preferably by at least 50%, 60%, or 70%, and most preferably by at least 80%, 90%, 95%, 99%, or more, relative to a control subject that is not treated with a composition of the invention). An effective amount of the pharmaceutical composition used to practice the methods described herein (e.g., the treatment of DMD) varies depending upon the manner of administration and the age, body weight, and general health of the subject being treated. A physician or researcher can decide the appropriate amount and dosage regimen.

As used herein, “expression vectors” or “expression plasmid”, and similar terms, are defined herein as DNA sequences that are required for the transcription of cloned copies of genes and/or and the translation of their mRNAs in an appropriate host. Such vectors can be used to express viral, prokaryotic, or eukaryotic genes in a variety of hosts including, but not limited to, bacteria, for example, E. coli, blue-green algae, plant cells, insect cells, fungal cells including yeast cells, and animal cells.

As used herein, the term “fusion protein” includes a compound containing all or a portion of the amino acid sequences of two or more proteins. The modified transcription factors (e.g., modified human transcription factors) described herein are excluded from the definition of fusion proteins. For example, a fusion protein can comprise a DNA binding element (e.g., Jazz, Bagly, or UtroUp) and one or more transcriptional activation elements (e.g., Vp16, CJ7, SP1, or Gal4). The terms “protein” and “polypeptide” are used interchangeably herein. The term “portion” includes any region of a polypeptide, such as a fragment (e.g., a cleavage product or a recombinantly-produced fragment) or an element or a domain (a region of a polypeptide having an activity, such as, e.g., enzymatic activity or antigen- or DNA-binding capacity) that contains fewer amino acids than the full-length polypeptide (e.g., 5%, 10%, 12%, 15%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% fewer). A fusion protein may include one or more linkers between the amino acid sequences of the proteins. Those skilled in the art will recognize that the terms “portion” and “fragment” are also used interchangeably herein.

By “host,” “subject,” or “patient,” is meant any organism, such as a mammal (e.g., a human, a primate, dog, cat, cow, horse, pig, goat, rat, and mouse), a fish, or a bird. A host may also be a domestic animal (e.g., a farm animal) or a companion animal (e.g., a pet).

By “immunogenicity” is meant the ability of a particular substance (including, e.g., an antigen, an epitope, or a protein) to provoke an immune response in the body of a human or animal. An immune response may include a humoral or cell-mediated immune response. Immunogenicity is typically undesirable during treatment with therapeutic proteins, including artificial transcription factors of the invention, because some patients have or develop antibodies that bind and/or neutralize the therapeutic protein, leading to inactivation of the therapeutic protein and in some cases adverse effects. Factors including delivery route, delivery vehicle, dose regiment, aggregation, innate immune system activation, molecular size, epitope density, phylogenetic distance, protein structure, degradability, and the ability of the protein to interface with humoral (B cell) and cellular (T cell) responses can all affect the level of immunogenicity.

By “increasing muscle contractile force” is meant increasing the force that can be exerted by contraction of a muscle by at least at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, or by 3-fold, 4-fold, 5-fold, or more compared to reference. The force generated by a muscle can be measured in vivo, in situ, or ex vivo, for example by ex vivo or in situ analysis of the contractile profile of a single intact limb muscle (e.g. the extensor digitorum longus or abdominal muscles for ex vivo assay and the tibialis anterior muscle for in situ assay), grip force analysis, downhill treadmill exercise, manual muscle testing, myometry (e.g. assessing upper and lower extremity strength using a myometer, including evaluation of knee flexors and extensors, elbow flexors and extensors, and shoulder abductors). sustained maximum voluntary contraction (MVC) assays or in any of the assays described in Hakim et al., Methods Mol. Biol. 709: 75-89, 2011; Sharma et al., Neurology 45: 306-310, 1995; and McDonald et al., Muscle Nerve 48: 343-356, 2013.

By “increasing muscle endurance” is meant increasing the ability of a muscle or group of muscles to sustain repeated contractions (e.g., an increase of at least 1%, 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold, or more compared to reference) over an extended period of time (e.g., a period of time in hours (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 hours), days (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 days), weeks (e.g. 1, 2, 3, 4, 5, 6, or 7 weeks), or months (e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 months)). Muscle endurance may be measured by assays including but not limited to treadmill exercise, 6-minute walk test (6MWT), timed function tests, (e.g., time taken to stand from a supine position, time taken to run/walk 10 m, or time taken to climb/descend 4 standard-sized stairs), by any of the tests suitable for testing muscular strength or endurance in mice including but not limited to enforced treadmill exercise, either at constant speed (e.g., any assay described in Radley-Crabb et al., Neuromuscul. Disord. 22(2):170-182, 2012) or at accelerated speed (see, e.g., Di Certo et al., Hum. Mol. Genet. 19:752-760, 2010 or Strimpakos et al., J. Cell. Phys. 229:1283-1291, 2014), voluntary wheel exercise, grip strength test, the hang wire test, the inverted grid test, and the rotarod test, or by any of the assays described in McDonald et al., Muscle Nerve 48: 343-356, 2013.

By “increasing utrophin expression” is meant to increase utrophin gene transcription, protein expression, and/or protein activity as compared to a normal or positive reference cell or tissue (e.g., an increase of at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold or more compared to a reference cell or tissue). The expression product can be an RNA transcribed from the gene (e.g. an mRNA) or a polypeptide translated from an mRNA transcribed from the gene. Typically, an increase in the level of an mRNA results in an increase in the level of a polypeptide translated therefrom. The level of expression and/or activity may be determined using standard techniques for detecting and measuring mRNA or protein, including but not limited to RT-PCR, Western blotting, enzyme-linked immunosorbant assay (ELISA), immunohistochemistry, immunofluorescence, and mass spectrometry.

An “insertion” or “addition,” as used herein, refers to a change in an amino acid or nucleotide sequence resulting in the addition of one or more amino acid or nucleotide residues, respectively, as compared to the naturally occurring molecule.

The terms “linker,” “linker region,” or “linker domain,” or similar, such descriptive terms as used herein refers to elements that are located between adjoined polynucleotide or polypeptide sequences. For example, stretches of polynucleotide or polypeptide sequence that are used in can be introduced during the construction of a cloning vector or fusion protein. Linkers can also be introduced by chemical conjugation, e.g., by ‘click chemistry’-based approaches. Functions of a linker region can include introduction of cloning sites into the nucleotide sequence, introduction of a flexible component or space-creating region between two protein domains, or creation of an affinity tag for specific molecular interaction. A linker region may be introduced into a fusion protein as a product of the recombinant nucleic acid production.

By “muscle-specific promoter” is meant any sequence of DNA that can promote expression of a gene preferentially in muscle tissue compared to a variety of other tissue types. Typically, a muscle-specific promoter is present upstream of the transcription initiation site of a gene. For example, a muscle-specific promoter may increase expression of a gene in a muscle at least 2-fold, 3-fold, 4-fold, 5-fold, 10-fold, 15-fold, 20-fold, 30-fold, 50-fold, 100-fold or more compared to a reference non-muscle tissue. Some muscle-specific promoters are constitutively expressed throughout differentiation. Exemplary muscle-specific promoters include but are not limited to: alpha-actin, cardiac troponin C, myosin light chain 2A, skeletal beta-actin, CK6, dystrophin, muscular creatine kinase, dMCK, tMCK, enh348MCK, synthetic C5-12 (Syn), Myf5, MLC1/3f, MyoD1, Myog, and Pax7. See, for example, U.S. Patent Application 2011/0212529, McCarthy et al., Skeletal Muscle 2:8, 2012; and Wang et al., Gene Ther. 15:1489-1499, 2008, the entirety of which are incorporated herein by reference.

By “muscular dystrophy” is meant the group of muscle diseases that weaken the musculoskeletal system and hamper locomotion. Muscular dystrophies are characterized by progressive deterioration of muscle function (e.g., weakness), defects in muscle proteins, and death of muscle cells and tissue. Some types of muscular dystrophy are characterized as dystrophinopathy, which includes a spectrum of muscle diseases in which there is insufficient dystrophin protein produced in the muscle cells, resulting in instability in the structure of the muscle cell membrane. Non-limiting examples of dystrophinopathies include Duchenne's muscular dystrophy (DMD) and Becker's muscular dystrophy (BMD, also known as Benign pseudohypertrophic muscular dystropy). DMD and BMD are X-linked recessive diseases caused by mutations in the dystrophin gene, which encodes the protein dystrophin. DMD is more severe than BMD because typically no dystrophin protein is produced in the affected muscle cells in DMD, whereas in BMD, defective dystrophin is produced. Other examples of muscular dystrophies include but are not limited to: congenital muscular dystrophy, facioscapulohumeral muscular dystrophy, limb-girdle muscular dystrophy, myotonic muscular dystrophy, and oculopharyngeal muscular dystrophy.

“Nucleic acid sequence” as used herein, refers to an oligonucleotide, nucleotide, or polynucleotide, and fragments or portions thereof, and to DNA or RNA of genomic or synthetic origin which may be single- or double-stranded, and represent the sense or antisense strand.

As used herein, the term “operable linkage” or “operably linked” refers to a physical or functional juxtaposition of the components so described as to permit them to function in their intended manner. More specifically, for example, two DNA sequences operably linked means that the two DNAs are arranged (cis or trans) in such a relationship that at least one of the DNA sequences is able to exert a physiological effect upon the other sequence. For example, a muscle specific promoter can be operably linked with a gene to promote muscle-specific transcription of the gene.

By “pharmaceutical composition” is meant any composition that contains a therapeutically or biologically active agent (e.g., at least one nucleic acid molecule that encodes all or part of a fusion protein comprising a transcriptional activation element and a DNA binding element either incorporated into a viral vector or independent of a viral vector (e.g., incorporated into a liposome, microparticle, or nanoparticle)) that is suitable for administration to a subject. The compositions utilized in the methods described herein can be administered by a route selected from, e.g., parenteral (e.g., intravenous or intraperitoneal), dermal, transdermal, ocular, inhalation, buccal, sublingual, perilingual, nasal, rectal, topical, and oral. The compositions utilized in the methods described herein can also be administered locally or systemically. The term “parenteral” as used herein refers to subcutaneous, intracutaneous, intravenous, intraperitoneal, intramuscular, intraarticular, intraarterial, intrasynovial, intrasternal, intrathecal, intralesional, or intracranial injection, as well as any suitable infusion technique. A sterile injectable composition can be a solution or suspension in a non-toxic parenterally acceptable diluent or solvent. Such solutions include, but are not limited to, 1,3-butanediol, mannitol, water, Ringer's solution, and isotonic sodium chloride solution. In addition, fixed oils are conventionally employed as a solvent or suspending medium (e.g., synthetic mono- or diglycerides). Fatty acid, such as, but not limited to, oleic acid and its glyceride derivatives, are useful in the preparation of injectables, as are natural pharmaceutically acceptable oils, such as, but not limited to, olive oil or castor oil, polyoxyethylated versions thereof. These oil solutions or suspensions also can contain a long chain alcohol diluent or dispersant such as, but not limited to, carboxymethyl cellulose, or similar dispersing agents. Other commonly used surfactants, such as, but not limited to, Tweens or Spans or other similar emulsifying agents or bioavailability enhancers, which are commonly used in the manufacture of pharmaceutically acceptable solid, liquid, or other dosage forms also can be used for the purpose of formulation. Any of these formulations can be prepared by well-known and accepted methods of art. See, for example, Remington: The Science and Practice of Pharmacy (21^(st) ed.), ed. A. R. Gennaro, Lippincott Williams & Wilkins, 2005, and Encyclopedia of Pharmaceutical Technology, ed. J. Swarbrick, Informa Healthcare, 2006, each of which is hereby incorporated by reference.

By “pharmaceutically acceptable diluent, excipient, carrier, or adjuvant” is meant a diluent, excipient, carrier, or adjuvant which is physiologically acceptable to the subject while retaining the therapeutic properties of the pharmaceutical composition with which it is administered. For injection, formulations can be prepared in conventional forms as liquid solutions or suspensions or as solid forms suitable for solution or suspension in liquid prior to injection or as emulsions. Acceptable carriers, excipients, or stabilizers for intravenous administration are nontoxic to recipients at the dosages and concentrations employed, and include buffers such as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid and methionine; preservatives (such as octadecyldimethylbenzyl ammonium chloride; hexamethonium chloride; benzalkonium chloride, benzethonium chloride; phenol, butyl or benzyl alcohol; alkyl parabens such as methyl or propyl paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; and m-cresol); low molecular weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, histidine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugars such as sucrose, mannitol, trehalose or sorbitol; salt-forming counter-ions such as sodium; metal complexes (e.g., Zn-protein complexes); and/or non-ionic surfactants such as TWEEN™, PLURONICS™, or polyethylene glycol (PEG). Suitable excipients include, for example, water, saline, dextrose, glycerol and the like. Such compositions may also contain amounts of nontoxic auxiliary substances such as wetting or emulsifying agents, pH buffering agents and the like, such as, for example, sodium acetate, sorbitan monolaurate, and so forth. Other physiologically acceptable diluents, excipients, carriers, or adjuvants and their formulations are known to one skilled in the art.

By “recombinant adeno-associated vector (AAV)” is meant a recombinantly produced virus or viral particle that comprises a polynucleotide to be delivered into a host cell, either in vivo, ex vivo or in vitro. AAV is a nonpathogenic human parvovirus which is commonly used for gene transfer in mammals. The AAV genome is built of single stranded DNA, and comprises inverted terminal repeats (ITRs) at both ends of the DNA strand, and two open reading frames: rep and cap, encoding replication and capsid proteins, respectively. A foreign polynucleotide can replace the native rep and cap genes. AAVs can be made with a variety of different serotype capsids which have varying transduction profiles or as used herein “tropism” for different tissue types. Examples of AAV serotypes include but are not limited to AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, and AAVrh10. AAV vectors can be produced, for example, by triple transfection of subconfluent HEK293 cells by three plasmids: AAV cis-plasmid containing the gene of interest, AAV trans-plasmid containing AAV rep and cap genes, and an adenovirus helper plasmid, for example, pDF6. For example, the expression vectors described by SEQ ID NOs:1 or 2 or SEQ ID NOs:62-65 can be in the production of recombinant adeno-associated vectors (e.g., viral particles that comprise nucleotide sequences that allow for muscle-specific expression of artificial transcription factors of the invention). Viral particles are described herein by the expression vector used for their production and by the serotype. For example, the viral particle mAAV8-Vp16-Jazz have a capsid serotype 8 and are produced using the mAAV-Vp16-Jazz expression vector.

By “reference” is meant any sample, standard, or level that is used for comparison purposes. A “normal reference sample” can be a sample taken from the same subject prior to the onset of a disorder (e.g., a muscular dystrophy), a sample from a subject not having the disease or disorder, a subject that has been successfully treated for the disease or disorder, or a sample of a purified reference polypeptide at a known normal concentration. By “reference standard or level” is meant a value or number derived from a reference sample. A normal reference standard or level can be a value or number derived from a normal subject that is matched to a sample of a subject by at least one of the following criteria: age, weight, disease stage, and overall health. A “positive reference” sample, standard, or value is a sample, standard, value, or number derived from a subject that is known to have a disorder (e.g., a muscular dystrophy) that is matched to a sample of a subject by at least one of the following criteria: age, weight, disease stage, and overall health.

By “skeletal muscle” is meant the form of striated muscle tissue which is under the control of the somatic nervous system, i.e., it is voluntarily controlled. The term muscle refers to multiple bundles of muscle fibers held together by connective tissue. Skeletal muscles may be attached to bones by tendons. Skeletal muscles may be attached to bones by tendons. Non-limiting examples of skeletal muscles include, for example, the diaphragm, extensor digitorum longus, tibialis anterior, gastrocnemius, soleus, plantaris, biceps, triceps, deltoids, pectoralis major, pectoralis minor, rhomboids, trapezius, sartorius, knee flexors and extensors, elbow flexors and extensors, shoulder abductors, and abdominal muscles.

By “specifically binds” or “binds” is meant a molecule (e.g., an artificial transcription factor) which recognizes and binds another molecule (e.g., a polynucleotide), but that does not substantially recognize and bind other molecules. In one example, an artificial transcription factor specifically binds a DNA sequence in the utrophin “A” promoter but not other polynucleotide sequences. The binding affinity of a molecule X for its partner Y can generally be represented by the dissociation constant (Kd). The term “specific binding,” “specifically binds to,” or is “specific for” a particular molecule (e.g., a polynucleotide or a polypeptide), as used herein, can be exhibited, for example, by a molecule having a K_(d) for the molecule to which it binds of at least about 10⁻³, 10⁻⁴ M, 10⁻⁵ M, 10⁻⁶ M, 10⁻⁷ M, 10⁻⁸ M, 10⁻⁹ M, 10⁻¹⁰ M, 10⁻¹¹ M, 10⁻¹² M, or greater. “Binding affinity” generally refers to the strength of the sum total of noncovalent interactions between a single binding site of a molecule (e.g., a fusion protein) and its binding partner (e.g., a polynucleotide sequence).

The term “substantially the same” when used herein with respect to the comparison of a sequence to a reference sequence is applicable to sequences that have at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 98%, and at least about 99% sequence identity to the reference sequence. The determination of percent identity between two sequences can be determined using standard methods and algorithms including, e.g., BLASTN (NCBI; Schaffer et al., Nucleic Acids Res. 29:2994-3005, 2001), BLASTX (NCBI; Schaffer et al., Nucleic Acids Res. 29:2994-3005, 2001), ALIGN (GCG, Accelrys), and FASTA (Pearson et al., Proc. Natl. Acad. Sci. U.S.A. 85:2444-2448, 1988) programs, which may employ default settings. In various specific examples, as described herein, amino acid sequences of the invention include those having, e.g., 5, 7, 10, 20, 30, 40, 50, 75, or 100 consecutive amino acids that are 100% identical to the reference sequences.

A “substitution”, as used herein, refers to the replacement of one or more amino acids or nucleotides by different amino acids or nucleotides, respectively.

By “transcriptional activation element” is meant a polypeptide that is capable of inducing transcription, i.e., the process of making an RNA transcript (e.g., an mRNA) from a DNA template by RNA polymerase. For example, a transcriptional activation element may comprise a trans-activation domain of a transcription factor protein or other domains that can directly or indirectly promote transcription. Transcriptional activation elements can also include polypeptides or fragments thereof that can recruit components of the transcriptional machinery, which includes transcriptional coregulators, transcriptional coactivators (e.g., TAF9, MED15, Gcn5, and CBP/p300), chromatin remodelers (e.g., histone acetylases, histone de-acetylases, or histone methylases), kinases, or DNA methylases, in order to promote transcription. Transcriptional activation elements may function by recruiting proteins that promote transcription or by removing transcriptional inhibitors. A transcriptional activation element typically requires an accompanying DNA binding element to bring it into proximity to DNA. Exemplary transcriptional activation elements include but are not limited to: acidic or hydrophobic activation domains (e.g., from Gal4 or Gcn4, respectively), nine-amino-acid transactivation domains (9aaTAD, e.g., from p53, Vp16, MLL, E2A, HSF1, NF-IL6, and NF-κB), Vp64, p65, SP1, Zif268, and the trans-activation domain CJ7 derived from human Che-1/AATF (see, e.g., Onori et al., BMC Mol. Biol. 14:3, 2013). See, for example, US Patent Application Publication No. 2007/0020627and Blancafort et al., Mol. Pharmacol. 66(8): 1361-137, 2004. Transcriptional activation elements can also include or be derived from physiological regulators of utrophin expression, including NFAT, GABPα, and GABPβ. An exemplary amino acid sequence of Vp16 is shown in SEQ ID NO:37, and an exemplary amino acid sequence of CJ7 is shown in SEQ ID NO:75.

By “transcription factor” is meant a protein that binds to specific DNA sequences, thereby controlling transcription of DNA into messenger RNA (mRNA). Transcription factors can perform this activity alone or with other proteins in a complex by promoting or blocking the recruitment of RNA polymerase. By “artificial transcription factor” is meant any transcription factor that does not occur in nature. An artificial transcription factor as described herein can be a fusion protein or a modified transcription factor.

By “treating” is meant administering a pharmaceutical composition of the invention for prophylactic and/or therapeutic purposes. Prophylactic treatment may be administered, for example, to a subject who is not yet ill, but who is susceptible to, or otherwise at risk of, a particular muscle disease or defect, e.g., DMD (e.g., the subject may have mutations that cause DMD but is still young and hence, asymptomatic or the status of mutations that cause DMD is unknown. Therapeutic treatment may be administered, for example, to a subject already suffering from DMD in order to improve or stabilize the subject's condition (e.g., a patient already presenting symptoms of DMD). Thus, in the claims and embodiments described herein, treating is the administration to a subject either for therapeutic or prophylactic purposes. In some instances, as compared with an equivalent untreated control, treatment may ameliorate a disorder (e.g., DMD) or a symptom of the disorder, or reduce the progression, severity, or frequency of one or more symptoms of the disorder by, e.g., 1%, 2%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 100% as measured by any standard technique. For example, for measuring symptoms of DMD, one may use, e.g., electromyography (EMG), genetic tests, muscle biopsy, serum Creatine Kinase (CK) levels, muscular strength tests (e.g., manual muscle testing), or range-of-motion (ROM) tests such as the six minute walk test. Symptoms of muscular dystrophy including DMD, which may vary from mild to severe and may depend on what part of the body is affected, the causative mutation, and the age and overall health of the affected person, include, e.g., fatigue, learning difficulties, intellectual disability, muscle weakness (e.g., in the legs, pelvis, arms, neck, diaphragm, heart, or other areas of the body), difficulty with motor skills (e.g., running, hopping, or jumping), frequent falls, trouble getting up from a lying position or climbing stairs, progressive difficulty walking, breathing difficulties, heart disease, abnormal heart muscle (e.g., cardiomyopathy), congestive heart failure, irregular heart rhythm (e.g., arrhythmias), deformities of the chest or back (scoliosis), enlarged muscles of the calves, buttocks, or shoulders, pseudohypertrophy, muscle deformities, respiratory disorders (e.g., pneumonia or poor swallowing). Detecting an improvement in, or the absence of, one or more symptoms of muscular dystrophy, indicates successful treatment.

By “utrophin “A” promoter” or “utrophin promoter “A”” is meant the promoter region located at the 5′ upstream region of the utrophin gene. The utrophin “A” promoter lies in an unmethylated CpG island and is active in muscle cells. This is in contrast to the utrophin “B” promoter which is immediately upstream of the large second exon of utrophin and is active in endothelial cells. An exemplary sequence of the human utrophin “A” promoter is defined by SEQ ID NO:20 or can include the sequences having SEQ ID NOs:9,10,12, or 14. An exemplary sequence of the mouse utrophin “A” promoter is defined by SEQ ID NO:33 or can include the sequences having SEQ ID NOs:11,13, or 15.

By “zinc finger motif” is meant a type of protein structural motif. Typically, zinc finger motifs coordinate one or more zinc ions in order to stabilize the fold. A zinc finger DNA binding domain may comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, or more zinc finger motifs arranged in a tandem array which can bind in the major groove of DNA. Cys₂.His₂ type zinc fingers are a well-known example of a zinc finger motif that consists of approximately 28-31 amino acids that fold into a ββα structure. The Cys₂.His₂ type zinc fingers motif typically has a sequence of the form X₃-Cys-X₂₋₄-Cys-X₁₂-His-X₃₋₅-His-X₄, wherein X is any amino acid (e.g. X₂₋₄ indicates an oligopeptide 2-4 amino acids in length). There is generally a wide range of sequence variation in the 28-31 amino acids of the known zinc finger polypeptide. Only the two consensus histidine residues and two consensus cysteine residues bound to the central zinc atom are invariant. Of the remaining residues, three to five are highly conserved, while there is significant variation among the other residues. The alpha-helix of each motif (often called the “recognition helix”) can make sequence-specific contacts to DNA bases; residues from a single recognition helix typically contact 3 base pairs of DNA. Changes in the key amino acid positions (e.g., positions −1, +3, and +6) of the zinc finger alpha-helix can modify the DNA binding target specificity of a zinc finger motif. See, for example, Corbi et al., Biochem. Cell. Biol. 82:428-436, 2004; Klug, Q. Rev. Biophys. 43:1-21, 2010; Choo et al., Curr. Opin. Struct. Biol. 7:117-125, 1997; Pabo et al., Annu. Rev. Biochem. 70:313-340, 2001; and Segal et al., Curr. Opin. Biotechnol. 12:632-637, 2001; and Sera, Adv. Drug Deliv. Rev. 61:513-526, 2009. Lists of human Cys₂-His₂ type zinc finger proteins can be found, e.g., from the HUGO Gene Nomenclature Committee (HGNC). Cys2-His2 type zinc finger proteins often contain an effector domain located N-terminally to the zinc finger region, such as the KRAB (Kruppel-Associated-Box), SCAN (SRE-ZBP, CTfin51, AW-1 and Number18 cDNA) and BTB (Broad-Complex, Tramtrack and Bric-a-bric) effector domains. Other exemplary zinc finger motifs include Gag knuckle, Treble clef, zinc ribbon, Zn₂/Cys₆, and TAZ2-domain like zinc finger motifs.

Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A-1D show construction and validation of mAAV vectors. (A) Schematic representation of the engineered mAAV-EGFP and mAAV-Vp16-Jazz vectors under the control of the muscle-specific human alpha-actin promoter. (B) Time course titration curve obtained by qPCRs performed with growth medium fractions containing viral particles from AAV-293 transfected cells. (C) top: EGFP expression in skeletal muscles (abdomen, diaphragm and quadriceps) and cardiac muscle. mAAV8-EGFP-treated and untreated (Ctr) mdx mice were injected intraperitoneally at 5 days of age. Injections were performed with 150 μl of mAAV8-EGFP virus suspension at the concentration of 5×10¹²v.p./ml, or with the same volume of saline solution. At different times after injection mice were analyzed and cryostatic sections were examined for direct EGFP fluorescence. Nuclei were stained with DAPI. All the images were taken at 10× magnification. Bottom left: Graph showing the percentage of EGFP-positive fibers in the indicated muscular tissue at 15 days and 2 months of age. Bottom right: absence of fluorescence signal in the liver from mAAV8-EGFP-infected mdx mice both at 15 days and 2 months of age confirms the muscle-specific expression of the mAAV8-EGFP vector. (D) Evaluation of EGFP protein expression by Western blot analysis in abdominal and diaphragm skeletal muscle, cardiac muscle and liver tissues. Five-day-old mdx mice were injected as above and examined 15 days after injection using the polyclonal antibody against EGFP. Detection of α-tubulin was used to normalize the amount of proteins. Vp16-Jazz is abbreviated as “Jazz” in FIGS. 1A-1D.

FIGS. 2A-2E show expression of Vp16-Jazz and utrophin up-regulation in mdx mice after intraperitoneal infection at 5 days of age with 150 μl of mAAV8-Vp16-Jazz virus suspension at the concentration of 5×10¹² v.p./ml, or with the same volume of saline solution. (A) Evaluation of Vp16-Jazz-mRNA expression by RT-PCR of abdominal and quadriceps skeletal muscle mRNA from the mAAV8-Vp16-Jazz-treated and untreated mdx mice, as compared to a 32-microglobulin (b2M) mRNA control from the same samples, shown below. (B) Evaluation of Vp16-Jazz protein expression by Western blot analysis, using the anti-myc tag monoclonal 9E10 antibody, in skeletal (abdomen and quadriceps) muscles, cardiac muscles, and in liver tissues, 15 days post injection. Detection of α-tubulin was used to normalize the amount of proteins. (C) Immunohistochemistry of the quadriceps muscle derived from 2-month-old untreated and mAAV8-Vp16-Jazz-treated mdx mice stained with the anti-myc tag monoclonal 9E10 antibody (red). The extracellular matrix is stained with the anti-laminin polyclonal antibody (green), and nuclei are counterstained with DAPI. Scale bar: 20 μm. (D) Quantification by real-time PCR of utrophin transcripts from skeletal (abdomen and quadriceps) and heart muscles isolated from 2-month-old untreated mdx mice and mAAV8-EGFP-treated or mAAV8-Vp16-Jazz-treated mdx mice. The gene expression ratio between utrophin and β2-microglobulin (132M) is shown as the mean±SEM. from three independent experiments performed in triplicate. *P<0.05 and **P<0.01 indicate statistical significance by t-test. (E) Western blot analysis of utrophin protein levels in abdominal, heart, and quadriceps muscles isolated from 2-month-old mAAV8-Vp16-Jazz-treated or mAAV8-EGFP-treated mdx mice. Representative individual mice are indicated with numbers. Detection of laminin was used to normalize the amount of proteins. Vp16-Jazz is abbreviated as “Jazz” in FIGS. 2A-2E.

FIGS. 3A-3C show that Vp16-Jazz ameliorates mdx muscle morphology and histopathology. (A) Hematoxylin and eosin (H&E) staining of the quadriceps muscle from 2-month-old untreated, mAAV8-EGFP-treated, and mAAV8-Vp16-Jazz-treated mdx mice. Reduction in degeneration, necrotic foci, and inflammatory cells is observed in mAAV8-Vp16-Jazz-treated myofibers (representative sections out of six mice examined in each group). (B) Quantification of central nucleation (CNF), cross-sectional area (CSA), and inflammatory infiltrates on H&E stained sections of quadriceps muscle from 2-month-old untreated, mAAV8-EGFP-treated, and mAAV8-Vp16-Jazz-treated mdx mice (six mice were analyzed in each group; the number of CNFs was obtained by normalizing to the number of total myofibers per CSA, and at least 200 myofibers per section were counted). (C) Immunohistochemistry of the quadriceps muscle isolated from 2-month-old untreated, mAAV8-Vp16-Jazz-treated, and mAAV-EGFP-treated mdx mice (four mice were analyzed in each group). Quantification of macrophage infiltration was performed by staining with CD68 monoclonal antibody (light shading). The extracellular matrix is counterstained with the anti-laminin polyclonal antibody (reticulated staining). Nuclei are stained with DAPI (not visible in grayscale); see Example 1: Materials and Methods section). Right: Graph shows the quantification of the CD68-positive area (four mice were analyzed in each group). All values are expressed as the mean±SEM. *P<0.05 and ***P<0.001 indicate statistical significance by t-test. Vp16-Jazz is abbreviated as “Jazz” in FIGS. 3A-3E.

FIGS. 4A-4D show rescue of muscle function in dystrophic mdx mice by mAAV8-Vp16-Jazz treatment. (A) Mechanical response of isolated muscles. Effects of changes in voltage on isotonic contractions in EDL and abdominal muscles, from control and mAAV8-Jazz-treated mdx mice. Muscle contractile force for each voltage was determined and considered as a percentage of maximal contraction. Each point represents the mean of 10 extensor digitorum longus (EDL) muscles (5 animals) and five abdominal muscle strips, with error bars indicating SEM and **P<0.01 indicating the statistical significance by t-test. (B) Procion Orange dye uptake in sections of abdominal and EDL muscles after force test. Left: Representative images demonstrating the increased ability of mAAV8-Vp16-Jazz-treated mdx mice to exclude dye from stretched fibers. Right: graph shows the mean (±SEM) area of dye-positive fibers expressed as the percentage of the total CSA of muscle sections. (C) Single session performance (left) and total running time (right) relative to four weekly treadmill trials with exhaustive exercise protocol. The mdx mice injected with saline, mAAV8-EGFP, or mAAV8-Vp16-Jazz at day 5 after birth were tested at 3 months of age. For each group, lines indicate the mean duration of running time during each trial (left), and columns indicate the cumulative running time over the four consecutive trials (right). Number of animals was 10 for each experimental group. The mAAV8-Vp16-Jazz-treated mdx mice showed a significant improvement of exercise performance as compared to mdx and to mAAV8-EGFP-treated mdx mice. Statistical analysis with “unpaired t-test” showed a significant main effect of treatment on mice performance indicated by **P<0.01 and ***P<0.001. (D) Evan's blue dye (EBD) uptake was used to compare skeletal muscle membrane integrity after exercise. Top: abdominal muscle and diaphragm muscles from untreated and mAAV8-EGFP-treated (controls) or mAAV8-Vp16-Jazz-treated mdx mice were monitored for EBD uptake by fluorescence microscope. EBD uptake (light shading) was significantly higher in sections of control mdx mice as compared with mAAV8-Vp16-Jazz-treated mdx mice. Bottom: Evans blue uptake was also scored as a percentage of Evans blue positive myofibers. *P<0.05 and ***P<0.001 indicate statistical significance by t-test. Scale Bar: 50 μm. Vp16-Jazz is abbreviated as “Jazz” in FIGS. 4A-4D.

FIGS. 5A-5E show expression of modified human zinc finger proteins JZif1 and JZif2 and up-regulation of utrophin expression by these proteins in mouse and human cells. (A) Western blot analysis of C2C7 mouse myotube cells infected with mAAV8-EGFP, mAAV8-JZif1, mAAV8-JZif2, or combinations thereof. JZif1 and JZif2 proteins were detected with a commercial anti-Zif268 (Egr1) antibody (sc-189, Santa Cruz). Expression of EGFP, JZif1, or JZif2 is under the control of the human alpha-actin promoter. (B) Top left: representative bright field image of differentiated C2C7 cells infected with mAAV8-EGFP. Bottom left: EGFP fluorescence signal of C2C7 cells shown in top left panel. Top right: representative indirect immunofluorescence image of differentiated C2C7 myotubes infected with mAAV8-JZif1 (performed as described in Example 1). Detection was with rabbit polyclonal antibody anti-Zif268/EGR-1. Bottom right: Merge of Zif268 signal shown in top panel with Hoechst-stained nuclei (not visible in grayscale). (C) Western blot analysis of utrophin protein levels in HeLa cells (human) induced to myogenic differentiation by transfection with a plasmid expressing MyoD (control) that were co-transfected with mAAV-JZif1, mAAV-JZif2, or both. Detection of laminin and α-tubulin was used to normalize the amount of proteins. (D) Western blot analysis of mouse 3T3 fibroblasts induced to myogenic differentiation by transfection with a plasmid expressing MyoD, and co-transfected with mAAV-JZif1. Detection was with the indicated antibodies. Left panels: Expression of JZif1 in 3T3 fibroblasts. Detection of β-actin was used to normalize the amount of proteins. Right panels: Western blot analysis of utrophin protein levels. Detection of laminin was use to normalize the amount of proteins. (E) Western blot analysis of utrophin protein levels in heart muscles isolated from 2-month old mAAV8-EGFP, mAAV8-JZif1, or mAAV8-JZif2-treated mdx mice. Detection of laminin was used to normalize the amount of proteins.

FIG. 6 is a histogram showing the fold increase in utrophin mRNA transcripts, quantified by real-time PCR, in skeletal (abdomen, diaphragm) muscle and cardiac muscle from 6-week old mdx mice untreated or infected with mAAV8-Vp16-Jazz, mAAV8-JZif1, or mAAV8-JZif2. Control β2-microglobulin mRNA transcripts were quantified as a control. Shown is the mean gene expression ratio between utrophin and β2-microglobulin (β2M). Error bars indicate the standard deviation (SD). Vp16-Jazz is abbreviated as “Jazz” in FIG. 6.

FIG. 7 is a histogram showing the fold-induction of luciferase expression activity (Luc activity) in HeLa cells transfected by the pXP-Luc (pXP) construct, carrying the luciferase gene under regulation of a portion of the utrophin promoter “A”. The HeLa cells were co-transfected with pXP and the indicated plasmid carrying JZif1, JZif2, Vp16-CJ7-UtroUp, Vp16-Bagly, Vp16-Jazz, Jazz (zinc fingers alone), the activation domain of Zif268 (Zif268AD), or the activation domain of Vp16 alone (VP16AD), each under regulation of the cytomegalovirus (CMV) promoter. Luciferase induction was compared to that achieved in HeLa cells transfected with the pGL2-Prom construct, carrying the luciferase gene under regulation of the pGL2-SV40 promoter (Promega), alone or co-transfected with JZif1 under control of the CMV promoter. The data are presented as the means±SD of at least three independent experiments that were performed in triplicate.

FIG. 8 is a histogram showing the fold of induction of luciferase expression activity in murine C2C7 myogenic cells transfected with the pXP-Luc construct, carrying the luciferase gene under regulation of a portion of the utrophin promoter “A”. Myogenic cells were co-transfected with plasmids carrying JZif1, JZif2, Vp16-CJ7-UtroUp, Vp16-Bagly, Vp16-Jazz, or wild type Zif268, all under regulation of the alpha-actin promoter. Luciferase activity was assayed after 2 days of differentiation (in DMEM containing 2% fetal bovine serum). The data are presented as the means±SD of at least two independent experiments that were performed in triplicate.

FIG. 9 is a histogram showing the fold-induction of luciferase expression activity (Luc activity) in murine C2C7 myogenic cells transfected with the pXP construct, carrying the luciferase gene under regulation of a portion of the utrophin promoter “A”. Myogenic cells were co-transfected with a plasmid carrying Vp16-Jazz under the regulation of the myosin light chain (MLC) promoter or Vp16-Jazz under the regulation of the alpha-actin promoter. Luciferase activity was measured after two days of incubation in differentiation medium (DMEM containing 2% fetal bovine serum). The data are presented as the means±SD of three independent experiments that were performed in triplicate.

FIGS. 10A-10B are graphs showing the effects of changes in voltage on isotonic contractions in EDL (A) and abdominal muscles (B), from 6 week old control WT C57BL6 mice, mdx-untreated mice, and mdx mice systemically injected with mAAV8-Vp16-Jazz, mAAV8-JZif1, mAAV8-JZif2, mAAV8-Vp16-CJ7-UtroUp and mAAV8-Vp16-Bagly as described in Example 1. The muscle's contractile force (tension) for each voltage was determined and plotted as a percentage of maximal contraction. Each point represents the mean of two EDL muscles (A) or one abdominal muscle strip (B) for each animal per group as indicated in the legend, with error bars denoting standard error of the mean (SEM). The increased contractile force exerted by EDL or abdominal muscles from wild-type mice or mice infected with mAAV8-Jazz, mAAV8-JZif1, mAAV8-Vp16-CJ7-UtroUp, and mAAV8-Vp16-Bagly compared to untreated mdx mice was statistically significant as calculated by Student's t-test (p<0.05) over a range of voltage values.

FIG. 11 shows analysis of muscle endurance: 6 week old wild type (WT) C57BL6, mdx, and mdx mice treated at 5 days post birth with mAAV8-Vp16-Jazz, mAAV8-JZif1, mAAV8-JZif2, mAAV8-Vp16-CJ7-UtroUp, and mAAV8-Vp16-Bagly were subjected to treadmill analysis by the Accelerated Method, as described in Example 1. Panel on left depicts the average running time for each of the three treadmill runs post the initial habituation run. The number of animals used for averaging in each category is marked in parentheses. Panel on the right depicts the average total running time for all three runs. The number of animals used for averaging in each category is marked in parentheses. All data are shown as mean±SEM. Data for total time running during treadmill test, were statistically analyzed by one-way analysis of variance (ANOVA) followed by pairwise post-hoc comparisons using Bonferroni-Dunn test. (**) indicates p<0.01, (***) indicates p<0.001. The mdx mice treated with JZif2 were not significantly different compared to mdx mice. All treated mice that are significantly different compared to mdx mice were not significantly different compared to WT.

FIGS. 12A-12C show schematic depictions of wild type Zif268 and modified transcription factors JZif1 and JZif2. (A) Top: On the left side is a schematic representation of the natural human three zinc finger motif transcription factor Zif268, followed by a representation of the two novel artificial three zinc finger motif transcription factors named “JZif1” and “JZif2”. Zif268, JZif1 and JZif2 genes have been cloned and are shown here in the mAAV vector under the control of the muscle-specific human alpha-actin promoter. On the right side the corresponding DNA target sequences for each protein is shown. Bottom: Amino acid sequences of Zif268, JZif1, and JZif2 zinc finger motifs are shown and aligned. Amino acid positions −1, +3, and +6 of the alpha helix, crucial for DNA binding specificity, are indicated and the amino acids are represented in bold. Underlined amino acid positions represent variations in JZif1 and JZif2 from the Zif268 prototype zinc finger motif. Note that zinc finger/DNA recognition involves an anti-parallel arrangement of the protein: the amino-terminal region is involved in 3′ DNA contact, while the carboxyl-terminal region is involved in 5′ DNA recognition (indicated by crossed lines between residues in the amino acid sequence of the zinc finger motifs and base pairs of the target DNA sequences). (B) shows the amino acid sequence of JZif1 (SEQ ID NO:38). The three zinc finger motifs (ZF1, ZF2, and ZF3) are highlighted in grey. The amino acid positions of the zinc finger motif alpha-helix are indicated above the corresponding amino acid residues (positions −1, 1, 2, 3, 4, 5, 6, 7, 8, and 9). Amino acid positions 1, +3, and +6 of the zinc finger motif alpha-helix are represented in bold. Underlined amino acid residues represent variations in JZif1 from the Zif268 prototype zinc finger motif. (C) shows the amino acid sequence of JZif2 (SEQ ID NO:39). The three zinc finger motifs (ZF1, ZF2, and ZF3) are highlighted in grey. The amino acid positions of each zinc finger motif alpha-helix are indicated above the corresponding amino acid residues (positions −1, 1, 2, 3, 4, 5, 6, 7, 8, and 9). Amino acid positions 1, +3, and +6 of the zinc finger motif alpha-helix are represented in bold. Underlined amino acid residues represent variations in JZif2 from the Zif268 prototype zinc finger motif.

FIG. 13 shows a sequence comparison of a region of the human and mouse utrophin promoter “A”. The N-box core sequence and conserved E-box are indicated. The numbering corresponds to: human utrophin promoter sequence EMBL accession no. X95523 and mouse utrophin promoter sequence: EMBL accession no. X95524. The eighteen base pair DNA target sequence recognized by exemplary artificial transcription factors of the invention is underlined. The first nine base pairs are in bold characters and are recognized by Jazz, JZif1, and Bagly. Bagly in addition to these nine base pairs extends its target sequence on the human sequence three additional base pairs in the 5′ direction (shown in Italic characters). The second nine base pairs (shown in underlined, bold Italic characters), are recognized by JZif2. UtroUp binds the entire eighteen base pair. DNA target sequence (underlined and bold characters).

FIG. 14 is a histogram depicting the mean increase in utrophin protein levels in diaphragm muscles isolated from 6-week old mdx mice that were treated with mAAV8-Vp16-Jazz, mAAV8-JZif1, or mAAV8-JZif2 viral vectors, relative to utrophin levels in diaphragm muscles isolated from untreated mdx mice. Utrophin protein expression was assessed by Western blot using the mouse polyclonal anti-utrophin antibody A01 (Abnova). Detection of laminin was used to normalize the amount of proteins. Relative values were measured by densitometric analysis using ImageJ software. At least two animals were tested per group. Vp16-Jazz is abbreviated “Jazz” in FIG. 14.

FIGS. 15A-15B show indirect immunofluorescence images of sections of diaphragm (FIG. 15A) and abdomen (FIG. 15B) muscles. Five-day old mdx mice were intraperitoneally injected with PBS (mdx-control) or with mAAV8-Jazz, mAAV8-JZif1, or mAAV8-JZif2 and examined at 6 weeks of age. The sections were immunostained with anti-utrophin monoclonal antibody (BD Lab Transduction, middle panels). The extracellular matrix is counterstained with the anti-laminin polyclonal antibody (left panels). Nuclei are stained with DAPI (right panels). All the images were taken at 20× magnification.

FIGS. 16A-16E shows a histogram of Evan's blue dye (EBD) uptake used to compare skeletal muscle membrane integrity at the end of the third session of treadmill exercise using the accelerated protocol (see Example 1 and FIG. 11). Evan's blue dye uptake was scored as percentage of EBD positive myofibers in tibialis anterioris (FIG. 16A), heart (FIG. 16B), quadriceps (FIG. 16C), abdomen (FIG. 16D), and diaphragm muscles (FIG. 16E) from untreated mdx mice and mdx mice systemically injected with mAAV8-Vp16-Jazz, mAAV8-JZif1, or mAAV8-JZif2 (mean of four slices of each muscle sections counted). Evans' blue dye uptake was detected by fluorescence microscopy. Error bars indicate SEM. Under each column in the histogram are representative images of EBD uptake for each treatment.

DETAILED DESCRIPTION OF THE INVENTION

Dystrophinopathies including DMD and BMD are caused by a lack of functional dystrophin protein in muscle cells. Utrophin is considered the autosomal homologue of dystrophin because dystrophin and utrophin share structural and functional motifs across the length of the proteins. Dystrophin and utrophin both function as connecting bridges between cytoskeletal actin, the cell membrane, and the extracellular matrix, via proteins which are collectively called DAPs (dystrophin associated proteins). Utrophin is mainly expressed in the fetus and its expression is reduced post-partum, whereas dystrophin is mainly expressed after birth. In adults the localization of utrophin is limited to the neuromuscular junction, while dystrophin localizes along the entire length of the sarcolemma. Studies carried out on transgenic mdx mice, a well-accepted animal model for DMD that lacks dystrophin, have shown that overexpression of utrophin, which is accompanied by redistribution of the protein across the entire sarcolemma, causes a marked improvement in the dystrophic phenotype of these mice.

To achieve utrophin up-regulation, we engineered artificial zinc finger-based transcription factor (ZF-ATF) fusion proteins capable of binding to the sequences defined by SEQ ID NO:9-14 and activating transcription from the promoter “A” of both the human and mouse utrophin genes. We generated a transgenic mouse model that overexpresses such a fusion protein containing three zinc finger motifs and a transcriptional activation element named “Vp16-Jazz” which recognizes and binds the sequence defined by SEQ ID NO:9. Vp16-Jazz transgenic mice are viable, fertile, and live to adulthood without any health problems. To evaluate the therapeutic potential of the artificial Vp16-Jazz gene, we generated an mdx/Vp16-Jazz mouse model by crossing Vp16-Jazz transgenic mice (Vp16-Jazz mice) with dystrophin-deficient mdx mice. The resulting mdx/Vp16-Jazz mice displayed expression of Vp16-Jazz, significant up-regulation of utrophin mRNA and protein expression and a strong amelioration of the dystrophic phenotype, as described in Italian patent application RM2005A000493.

The approach of the present invention is to up-regulate (i.e., increase) the expression level of the dystrophin-related gene utrophin in the muscles of DMD patients to functionally rescue (i.e., complement) the lack of dystrophin function. This can be done by contacting the utrophin gene promoter with an artificial transcription factor (ATF) of the invention, for example, a fusion protein or modified transcription factor capable of increasing utrophin expression. We have created muscle-specific recombinant adeno-associated vectors capable of promoting the expression of exogenous genes such as the fusion proteins and modified transcription factors of the invention both in skeletal muscle and in cardiac muscle, for use in treating diseases which severely affect both skeletal muscles and cardiac muscles (e.g., DMD). Moreover, the adeno-associated vectors of the present invention have the advantages of greater efficiency in the expression of the exogenous gene (both in slow muscle fibers and in fast muscle fibers), greater stability of the transcript and greater persistence of expression over time. Using the vectors of the present invention, we have discovered that recombinant adeno-associated virus (recombinant AAV) delivery of fusion proteins and modified transcription factors capable of increasing utrophin expression in muscle significantly ameliorates the dystrophic phenotype of mdx dystrophin-deficient mice. Furthermore, the delivery of these artificial transcription factors rescues both the histopathological defects observed in dystrophic mdx muscle tissue as well as in vivo muscle function, both in terms of muscle strength and endurance. Accordingly, the compositions including the vectors and fusion proteins of the present invention can be used to treat muscle defects, particularly DMD.

AAV Vectors and Compositions

In certain embodiments of this invention, a nucleic acid sequence, or fragment thereof, encoding a gene, e.g., an artificial transcription factor (e.g., a fusion protein or modified transcription factor) capable of increasing utrophin expression, is delivered to muscle cells by means of a viral vector, of which many are known and available in the art. In preferred embodiments, the artificial transcription factor is delivered to both skeletal and cardiac muscle. The therapeutic vector is desirably non-toxic, minimally immunogenic (i.e., elicits a very mild immune response, if any), easy to produce, and efficient in protecting and delivering DNA into the target cells. In particular embodiments, the viral vector is a recombinant adeno-associated vector (AAV). In other embodiments, the invention provides a therapeutic composition comprising an AAV comprising an artificial transcription factor (e.g., a fusion protein or a modified transcription factor) under the control of a muscle-specific promoter.

The use of AAVs is a common mode of exogenous delivery of DNA as it is relatively non-toxic, provides efficient gene transfer, and can be easily optimized for specific purposes. More than 30 naturally occurring serotypes of AAV are available. Many natural variants in the AAV capsid exist, allowing identification and use of an AAV with properties specifically suited for muscle cells. AAV viruses may be engineered by conventional molecular biology techniques, making it possible to optimize these particles for cell specific delivery of nucleic acid sequences, for minimizing immunogenicity, for tuning stability and particle lifetime, for efficient degradation, etc. Among the serotypes of AAVs isolated and characterized from human or non-human primates (NHP), human serotype 2 is the first AAV that was developed as a gene transfer vector; it has been widely used for efficient gene transfer experiments in different target tissues and animal models. Other AAV serotypes include, but are not limited to, AAV1, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, and AAVrh10. See, e.g., International Patent Application WO 2005/033321, for a discussion of various AAV serotypes.

In several embodiments, expression of artificial transcription factors of the present invention or other genes can be achieved in muscle cells through delivery by recombinantly engineered AAVs or artificial AAVs that comprise nucleic acid sequences encoding said artificial transcription factors or genes that are operably linked to muscle-specific promoters. In certain preferred embodiments, these AAVs are formulated as pharmaceutical compositions for treatment of muscle defects.

Muscle-specific expression vectors are described in US Patent Application Publication No. 2013/0136729, which describes the combination of AAV9 and muscle-specific CK6 promoter as a particularly efficient system for systemically carrying genes of interest into ischemic skeletal muscle. This publication further describes the combination of AAV9 and cardiac troponin C promoter as a system that is particularly effective in cardiomyocytes and skeletal myocytes with ischemic damage. International Patent Application WO 2005/118611 describes the delivery of microutrophin via adeno-associated vectors in the mdx dystrophic mouse model and incorporation of promoters that are active in muscle such as skeletal β-actin promoters, myosin light chain 2A promoters, dystrophin promoters and muscular creatine kinase promoters.

Expression vectors for the expression of exogenous genes such as zinc finger transcription factors in muscle and non-muscle tissue are also described in, for example, U.S. Pat. No. 8,304,235 (which describes the artificial transcription factor UtroUp (encoding a 6 zinc finger repeat designed to specially bind to utrophin promoter A) fused with the transcriptional activation domain “Vp16” from the Herpes simplex virus under the control of regulatory sequences of cytomegalovirus (CMV)) and the expression thereof via the eukaryotic vector pRK5 (Clontech), Italian Patent Application RM2005A000493, (which describes the artificial transcription factor Vp16-Jazz and the expression thereof via the vector pMEX under the control of the muscle-specific promoter and enhancer region of the murine myosin light chain (MLC) gene). Using these expression vectors, the exogenous gene is expressed mainly in skeletal muscle tissues or in cardiac muscle tissues, but not in both. The present invention advantageously allows for expression of exogenous genes in both skeletal and cardiac muscle tissues.

Desirable AAV fragments for assembly into vectors include the cap proteins, including vp1, vp2, vp3, and hypervariable regions, the rep proteins, including rep78, rep68, rep52, and rep40, and the sequences encoding these proteins. These fragments may be readily utilized in a variety of vector systems and host cells. Such fragments maybe used, alone, in combination with other AAV serotype sequences or fragments, or in combination with elements from other AAV or non-AAV viral sequences. As used herein, artificial AAV serotypes include, e.g., AAV with a non-naturally occurring capsid protein. Such an artificial capsid may be generated by any suitable technique, using a selected AAV sequence (e.g., a fragment of a vp1 capsid protein) in combination with heterologous sequences which may be obtained from a different selected AAV serotype, non-contiguous portions of the same AAV serotype, from a non-AAV viral source, or from a non-viral source. An artificial AAV serotype may be, without limitation, a pseudotyped AAV capsid, a chimeric AAV capsid, a recombinant AAV capsid, or a “humanized” AAV capsid. Pseudotyped vectors, wherein the capsid of one AAV is replaced with a heterologous capsid protein, are useful in the invention.

In one embodiment, the vectors useful in compositions and methods described herein contain, at a minimum, sequences encoding a selected AAV serotype capsid, e.g., an AAV8 capsid, or a fragment thereof. In another embodiment, useful vectors contain, at a minimum, sequences encoding a selected AAV serotype rep protein, e.g., AAV8 rep protein, or a fragment thereof. Optionally, such vectors may contain both MV cap and rep proteins. In vectors in which both AAV rep and cap are provided, the AAV rep and AAV cap sequences can both be of one serotype origin, e.g., all AAV8 origin. In other embodiments, the vectors useful in compositions and methods described herein contain, at a minimum, sequences encoding an AAV6 capsid, or a fragment thereof. In some embodiments, useful vectors contain sequences encoding an AAV6 rep protein, or a fragment thereof. In other embodiments, in vectors in which both AAV rep and cap are provided, the AAV rep and AAV cap sequences can be both of AAV6 origin.

Alternatively, vectors may be used in which the rep sequences are from an AAV serotype which differs from that which is providing the cap sequences. In one embodiment, the rep and cap sequences are expressed from separate sources (e.g., separate vectors, or a host cell and a vector). In another embodiment, these rep sequences are fused in frame to cap sequences of a different AAV serotype to form a chimeric AAV vector, such as AAV2/8 described in U.S. Pat. No. 7,282,199, which is incorporated by reference herein.

A suitable recombinant AAV is generated by culturing a host cell which contains a nucleic acid sequence encoding an AAV serotype capsid protein, or fragment thereof, as defined herein; a functional rep gene; a transgene nucleic acid comprising AAV inverted terminal repeats (ITRs), a muscle-specific promoter sequence, and an artificial transcription factor (e.g., a fusion protein or modified human transcription factor) sequence; and sufficient helper functions to permit packaging of the transgene nucleic acid into the AAV capsid protein. The components required to be cultured in the host cell to package an AAV minigene in an AAV capsid may be provided to the host cell in trans. Alternatively, any one or more of the required components (e.g., transgene nucleic acid sequences, rep sequences, cap sequences, and/or helper functions) may be provided by a stable host cell which has been engineered to contain one or more of the required components using methods known to those of skill in the art. AAV vectors of the present invention can be produced, for example, by triple transfection of subconfluent HEK293 cells by three plasmids: AAV cis-plasmid containing the gene of interest, AAV trans-plasmid containing AAV rep and cap genes, and an adenovirus helper plasmid, for example pDF6.

Such a stable host cell may contain the required component(s) under the control of an inducible promoter. However, the required component(s) may be under the control of a constitutive promoter. Examples of suitable inducible and constitutive promoters are provided herein, in the discussion below of regulatory elements suitable for use with the transgene. In still another alternative, a selected stable host cell may contain selected components under the control of a constitutive promoter and other selected components under the control of one or more inducible promoters. For example, a stable host cell may be generated which is derived from 293 cells (which contain E1 helper functions under the control of a constitutive promoter), but which contains the rep and/or cap proteins under the control of inducible promoters. Still other stable host cells may be generated by one of skill in the art.

The transgene, rep sequences, cap sequences, and helper functions required for producing the recombinant AAV of the invention may be delivered to the packaging host cell in the form of any genetic element which transfers the sequences carried thereon. The selected genetic element may be delivered by any suitable method, including those described herein. The methods used to construct any embodiment of this invention are known to those with skill in nucleic acid manipulation and include genetic engineering, recombinant engineering, and synthetic techniques. See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y. Similarly, methods of generating recombinant AAV virions are well known and the selection of a suitable method is not a limitation on the present invention. See, e.g., Fisher et al., J. Virol. 70:520-532, 1992 and U.S. Pat. No. 5,478,745, among others. These publications are incorporated herein by reference.

Unless otherwise specified, the AAV ITRs, and other selected AAV components described herein, may be readily selected from among any AAV serotype, including, without limitation, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, or other known and unknown AAV serotypes. These ITRs or other AAV components may be readily isolated using techniques available to those of skill in the art from an AAV serotype. Such AAV may be isolated or obtained from academic, commercial, or public sources (e.g., the American Type Culture Collection, Manassas, Va.). Alternatively, the AAV sequences may be obtained through synthetic or other suitable means by reference to published sequences such as are available in the literature or in databases such as, e.g., GenBank, PubMed, or the like.

A transgene of the present invention comprises, for example, an artificial transcription factor (e.g., a fusion protein or modified transcription factor) nucleic acid sequence (e.g., Vp16-Jazz, Vp16-Bagly, Vp16-CJ7-UtroUp, JZif1, or JZif2) or other gene desired to be delivered to muscles, as described above, and its regulatory sequences, and 5′ and 3′ AAV inverted terminal repeats (ITRs). In one desirable embodiment, the ITRs of AAV serotype 8 are used. However, ITRs from other suitable serotypes may be selected. It is this transgene which is packaged into a capsid protein and delivered to a selected host cell.

The regulatory sequences include conventional control elements which are operably linked to the transgene in a manner which permits its transcription, translation and/or expression in a cell transfected with the vector or infected with the virus produced by the invention. Expression control sequences include appropriate transcription initiation, termination, promoter and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence); sequences that enhance protein stability; sequences that confer nuclear localization (e.g., nuclear localization sequences) and when desired, sequences that enhance secretion of the encoded product. A great number of expression control sequences are known in the art and may be utilized. Poly A sequences may be derived from many suitable species, including, without limitation SV-40, humans, and bovines.

The regulatory sequences useful in the constructs of the present invention may also contain an intron, desirably located between the promoter/enhancer sequence and the gene. One desirable intron sequence is derived from SV-40, and is a 100 bp mini-intron splice donor/splice acceptor referred to as SD-SA. In a preferred embodiment, the intron comprises the first intron of the human alpha-actin gene, including the splice donor site, as well as part of the second intron of the human beta-globin gene and part of the third exon of the human beta-globin gene, including the splice acceptor site.

In other embodiments, the recombinant AAV vectors may include accessory functional elements including cap signals, sequences coding for epitope tags (e.g., haemmaglutinin (HA) tag, myc tag, maltose-binding protein (MBP) tag, green fluorescent protein (GFP) or any other fluorescent protein, etc.), and/or multicloning sites which contain a plurality of restriction enzyme cutting sites that make it possible to insert into the vector any gene coding for a protein of interest. The amino acid sequence of an exemplary myc tag is shown in SEQ ID NO:75.

Another regulatory component of the recombinant AAV useful in the invention is an internal ribosome entry site (IRES). An IRES sequence, or other suitable systems, may be used to produce more than one polypeptide from a single gene transcript. An IRES (or other suitable sequence) is used to produce a protein that contains more than one polypeptide chain or to express two different proteins from or within the same cell. An exemplary IRES is the FGF-1 internal ribosome entry sequence, which supports transgene expression in skeletal muscle cells. See, e.g., Delluc-Claviéres et al., Gene Ther. 15(15):1090-1098, 2008. Preferably, the IRES is located 3′ to the transgene in the recombinant AAV vector.

The selection of the promoter to be employed in the recombinant AAV may be made from among a wide number of constitutive, inducible, cell-type specific, or tissue-type specific promoters that can express the selected transgene in the desired muscle cell. In a preferred embodiment, the promoter is muscle-specific. In particularly preferred embodiments, the promoter is specific for expression of the transgene in skeletal and cardiac muscle cells.

The promoter(s) used in the present invention may be derived from any species. In one embodiment, the promoter is of a small size, under 1000 bp, due to the size limitations of the AAV vector. In another embodiment, the promoter is under 400 bp. Examples of constitutive promoters useful in the invention include, without limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer), the SV40 promoter, the dihydrofolate reductase promoter, the chicken β-actin (CBA) promoter, the phosphoglycerol kinase (PGK) promoter, the EF1 promoter (Invitrogen), and the immediate early CMV enhancer coupled with the CBA promoter.

Inducible promoters which can be used in the present invention allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have been described and can be readily selected by one of skill in the art. Examples of inducible promoters regulated by exogenously supplied compounds, include, the zinc-inducible sheep metallothionine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter system; the ecdysone insect promoter, the tetracycline-repressible system, the tetracycline-inducible system, the RU486-inducible system and the rapamycin-inducible system. Other types of inducible promoters which may be useful in this context are those which are regulated by a specific physiological state, e.g., temperature, acute phase, a particular differentiation state of the cell, or in replicating cells only. Any type of inducible promoter which is tightly regulated and is specific for the particular target cell type may be used.

Other regulatory sequences useful in the invention include enhancer sequences. Enhancer sequences useful in the invention include the IRBP enhancer, immediate early cytomegalovirus enhancer, one derived from an immunoglobulin gene or SV40 enhancer, the cis-acting element identified in the mouse proximal promoter, etc. Other enhancer sequences are also well-known in the art.

In preferred embodiments, the transgene of the invention comprises a muscle-specific promoter. For example, a muscle specific promoter may increase expression of a gene in a muscle at least 2-fold, 3-fold, 4-fold, 5-fold, 10-fold, 15-fold, 20-fold, 30-fold, 50-fold, 100-fold or more compared to a reference non-muscle tissue. Exemplary muscle-specific promoters include but are not limited to: alpha-actin, cardiac troponin C, myosin light chain 2A, skeletal beta-actin, CK6, dystrophin, muscular creatine kinase, dMCK, tMCK, enh348MCK, synthetic C5-12 (Syn), Myf5, MLC1/3f, MyoD1, Myog, and Pax7. See, for example, U.S. Patent Application 2011/0212529, McCarthy et al., Skeletal Muscle 2:8, 2012; and Wang et al., Gene Ther. 15:1489-1499, 2008.

Selection of these and other common vector and regulatory elements are conventional and many such sequences are available. See, e.g., Sambrook et al., supra and references cited therein, and Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1989. Of course, not all vectors and expression control sequences will function equally well to express all of the transgenes of this invention. However, one of skill in the art may make a selection among these, and other, expression control sequences without departing from the scope of this invention,

Expression Vectors for Production of Muscle-Specific Recombinant AAV

The invention provides expression vectors for the production of recombinant AAV that are muscle-specific (e.g., viral particles that comprise a polynucleotide sequence comprising a muscle-specific promoter and an artificial transcription factor (e.g., a fusion protein or a modified transcription factor)). For example, in one embodiment, the expression vectors described by SEQ ID NOs:1 or 2 or SEQ ID NOs:62-65 can be used to produce the recombinant AAV). In a particular embodiment, the expression vector comprises the nucleotide sequence called SEQ ID NO:1. The expression vector according to this embodiment is referred to as “muscle AAV” (“mAAV”). The functional elements of mAAV are further described below.

Located at the 5′ end is the Left-Inverted Terminal Repeat (L-ITR) sequence of the adeno-associated virus, while located at the 3′ end is the Right-Inverted Terminal Repeat (R-ITR) sequence of the adeno-associated virus. These inverted terminal repeat sequences are known per se and are described in the literature. In SEQ ID NO:1 they are respectively located between the nucleotide positions from 1 bp to 141 bp (L-ITR) and from 2899 bp to 3040 bp (R-ITR).

Between nucleotide positions 156 bp and 2219 bp of SEQ ID NO:1 there is also a transcriptional regulatory region which includes the promoter of the human alpha-actin gene, part of the first intron of the human alpha-actin gene, including the splice donor site, as well as part of the second intron of the human beta-globin gene and part of the third exon of the human beta-globin gene, including the splice acceptor site.

Between nucleotide positions 2219 and 2264 bp of SEQ ID NO:1 there is also a polylinker, or multiple cloning site, which contains a plurality of restriction enzyme cutting sites that make it possible to insert into the expression vector any gene coding for a protein of interest.

In addition to the aforementioned essential functional elements, there may also be accessory functional elements such as for example a cap signal, a sequence coding for a tag (for example Myc-tag), a polyadenylation consensus sequence, etc.

The main restriction sites, the polylinker, the regulatory region and the elements of the mAAV expression vector of SEQ ID NO:1 are shown in Table 1 below.

TABLE 1 Functional elements of mAAV expression vector Position Main functional elements (bp) Main 2 NotI sites  149 restriction SEQ ID NO: 27 2899 sites GCGGCCGC 1 MluI site  155 SEQ ID NO: 28 ACGCGT Polylinker SEQ ID NO: 7 2219-2264 ATCGATGGGAATT CCGGGATCCGGTC GACCGTACGTACA AGATCT Regulatory Alpha-actin  156-2219 region promoter and part of the intron comprising the beta-globin gene acceptor splice donor site: 1606 SEQ ID NO: 29 GCCCAGGTAGGG splice acceptor site: 2154 SEQ ID NO: 30 CCCACAGCTCCT Elements  Left-ITR   1-141 of the Right-ITR 2899-3040 AAV vector

As indicated above, due to the presence of the multiple cloning site, the expression vector of the invention is able to accommodate a sequence coding for any gene of interest, such as, for example a gene coding for a reporter protein or a gene coding for a protein to be specifically expressed in skeletal and cardiac muscle tissue. Particular proteins to be specifically expressed in skeletal and cardiac muscle tissue are artificial transcription factors capable of increasing expression of the utrophin gene.

The mAAV expression vector described herein can be used to produce recombinant AAV (e.g., viral particles that comprise a polynucleotide comprising a muscle-specific promoter and an artificial transcription factor of the invention). Polynucleotides that are portions of the mAAV expression vectors described herein are packaged into recombinant AAV viral particles. For example, the viral particles can include a polynucleotide sequence defined by SEQ ID NO:83-87, corresponding to portions of mAAV-Vp16-Jazz, mAAV-Vp16-Bagly, mAAV-Vp16-CJ7-UtroUp, mAAV-JZif1, and mAAV-JZif2, respectively. Viral particles are described herein by the expression vector used for their production and by the serotype. For example, the viral particles referred to as mAAV8-Vp16-Jazz and mAAV6-Vp16-Jazz have a capsid serotype 8 and serotype 6, respectively, and can be produced using the mAAV-Vp16-Jazz expression vector and methods described herein or known in the art.

DNA Binding Elements

The fusion proteins or modified transcription factors of the invention comprise DNA binding elements. In preferred embodiments, the DNA binding elements comprise zinc finger motifs that specifically bind to defined DNA sequences. In the most preferred embodiments, the DNA binding elements specifically bind to defined sequences in the utrophin “A” promoter. A DNA binding element of the instant invention may be derived or isolated from zinc finger motifs, which are well known in the art. Preferably, the zinc finger motif is derived from a Cys₂-His₂ type zinc finger. A zinc finger DNA binding element can be derived or produced from a wild type zinc finger-containing polypeptide by truncation or expansion, or as a variant of a wild type-derived polypeptide by a process of site directed mutagenesis, or by a combination of the procedures. See e.g., U.S. Pat. Nos. 6,242,568; 6,140,466; and 6,140,081. The term “truncated” refers to a zinc finger-nucleotide binding polypeptide that contains less that the full number of zinc finger motifs found in the native zinc finger binding polypeptide or that has been deleted of non-desired sequences. For example, truncation of the zinc finger-nucleotide binding protein TFIIIA, which naturally contains nine zinc finger motifs, might be a polypeptide with only zinc fingers one through three. Expansion refers to a zinc finger polypeptide to which additional zinc finger motifs have been added. For example, TFIIIA may be extended to 12 fingers by adding 3 zinc finger motifs.

In addition, a zinc finger DNA binding element may include zinc finger motifs from more than one wild type polypeptide, thus resulting in a “hybrid” zinc finger polypeptide. The term “mutagenized” refers to a zinc finger polypeptide that has been obtained by performing any of the known methods for accomplishing random or site-directed mutagenesis of the DNA encoding the protein. Examples of known zinc finger proteins that can be truncated, expanded, and/or mutagenized according to the present invention in order to alter the function of a zinc finger-nucleotide binding motif include TFIIIA and Zif268. Other zinc finger-containing nucleotide binding polypeptides are well known to those of skill in the art.

A DNA binding element of the present invention typically comprises a plurality of DNA binding domains. Preferably, the DNA binding domains are zinc-finger motifs. Preferably, the DNA binding element contains from 2, 3, 4, 5, 6, 7, 8, 9, or 10 such motifs, more preferably from 2 to 6 such motifs and, most preferably, 3 such motifs. The DNA binding domains are operably linked to each other. In one embodiment, the DNA binding domains are directly linked or bonded together via well known peptide linkages. In another embodiment, the DNA binding domains are operatively linked using a peptide linker containing from 5 to 50 amino acid residues. Preferably, the linker contains from 5 to 40 amino acid residues, more preferably from 5 to 30 amino acid residues and, even more preferably from 5 to 15 amino acid residues. The linkers are preferably flexible. Exemplary linkers are set forth, for example, in US Patent Application Publication No. 2007/0020627.

DNA binding elements used in the invention can be naturally-occurring or non-naturally occurring. Naturally-occurring zinc finger DNA binding domains are well known in the art. In a preferred embodiment, at least one DNA binding domain of a present DNA binding element is non-naturally occurring. Each of the DNA binding zinc finger motifs is preferably designed and made to specifically bind nucleotide target sequences corresponding to the formula 5′-NNN-3′, where N is any nucleotide (i.e., A, C, G or T). Such DNA binding domains are well known in the art. See, e.g., U.S. Pat. Nos. 6,242,568, 6,140,466 and 6,140,081. A known “recognition code” that relates the amino acids of a single zinc finger motif to its associated DNA target can be utilized as a guide for the design of the DNA binding elements of the present invention. See, for example, Corbi et al., Biochem Cell Biol. 82:428-36, 2004; Klug, Q. Rev. Biophys. 31(1):1-21, 2010; Pabo et al., Annu. Rev. Biochem. 70:313-340, 2001; Segal et al., Curr. Opin. Biotechnol. 12(6):632-637, 2001; Klug, Annu. Rev. Biochem. 79:213-231, 2010; and Bhakta et al., Methods Mol. Biol. 649:3-30, 2010. This code can be used for modular assembly of DNA binding elements, for example, by combining three separate zinc finger motifs that can each recognize a 3 bp DNA sequence to generate a three zinc-finger DNA binding element that can specifically recognize a 9 bp target site. Alternatively, screening methods or selection strategies can be used to identify zinc finger sequences that specifically bind to a desired DNA sequence. These methods include, for example, phage display, yeast one-hybrid systems, and bacterial one-hybrid and two-hybrid systems and other methods known to one of skill in the art (see, e.g., Maeder et al., Mol. Cell. 31:294-301, 2008). Combinations of zinc finger motifs that bind to specific DNA sequences can be obtained from commercial sources or using tools provided by the Zinc Finger Consortium.

Structural information about known zinc finger motifs can be used to guide the design of DNA binding elements that bind to a desired target sequence. For example, the structure of a three finger polypeptide-DNA complex derived from the mouse immediate early protein Zif268 (also known as Krox-24) has been solved by X-ray crystallography (see, e.g., Pavletich et al., Science, 252:809-817, 1991). Each finger contains an anti-parallel beta-turn, a finger tip region, and a short amphipathic alpha-helix which, in the case of the zinc finger motifs of Zif268, binds in the major groove of DNA. In addition, the conserved hydrophobic amino acids and zinc coordination by the cysteine and histidine residues stabilize the structure of the individual finger domain. The crystal structure of Zif268 indicates that specific histidine (non-zinc coordinating His residues) and arginine residues on the surface of the alpha-helix participate in DNA recognition. Specifically, the charged amino acids immediately preceding the alpha-helix and at helix positions 2, 3, and 6 (immediately preceding the conserved histidine) participate in hydrogen bonding to DNA guanines. In general, modifications in or near the alpha-helix are more likely to affect DNA binding specificity, whereas modifications in framework regions (e.g., beta turns or linker regions) are less likely to affect DNA binding specificity, while they may or may not affect structural integrity (e.g. proper folding) of the protein.

A zinc finger DNA binding motif of this invention typically comprises a unique heptamer (contiguous sequence of 7 amino acid residues) within the alpha-helix of the motif, which largely determines binding specificity to a target nucleotide. The heptameric sequence can be located anywhere within the α-helical domain but it is preferred that the heptamer extend from position −1 to position 6 as the residues are conventionally numbered in the art. In some embodiments, one or more modifications in the key amino acid positions (−1, +3, and +6) of the zinc finger alpha-helix can enable it to bind the desired DNA target sequence. In some embodiments, one or more modifications in other amino acid positions of the zinc finger alpha-helix can be used to enable it to bind the desired DNA target sequence (e.g., positions +1, +2, +4, or +5). In other embodiments, changes to residues outside of the heptamer can also be introduced to enable the protein to bind to the desired DNA target sequence. A zinc finger motif can include any β-sheet and framework sequences known in the art to function as part of a zinc finger motif. In some embodiments, the β-sheet and/or framework sequences are not substantially modified during the process of modifying a zinc finger domain to bind to a desired target sequence. Therefore, a zinc finger motif of a DNA binding element of the invention (e.g., in a fusion protein or a modified human transcription factor) may have 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more sequence identity to a known zinc finger motif, with only a few different amino acid residues, e.g., 1, 2, 3, 4, 5, 6, 7, or 8 amino acid residues modified from the known zinc finger motif. These modifications are typically located in the alpha-helix or in amino acid residues surrounding the alpha-helix. Residues that are involved structural integrity or proper 3 dimensional protein folding, e.g., in zinc coordination, are disfavored for modification. Methods to determine whether a DNA binding element can bind to a particular DNA sequence are well known to those skilled in the art, and include, e.g., electrophoretic mobility shift assays (EMSA), chromatin immunoprecipitation (ChIP), and DNase I protection assays.

Shown in Table 2 is a comparison of the zinc finger motif sequences for the Jazz, Bagly, and UtroUp DNA binding elements. In some embodiments, the zinc finger motifs of the invention have at least 50% sequence identity to any of the zinc finger motifs described in Table 2 or in SEQ ID NOs:54-60; e.g., 50%, 55%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity. In certain embodiments, the zinc finger motifs of the invention have less than 15 amino acid substitutions compared to any of the zinc finger motifs described in Table 2 or in SEQ ID NOs:54-60; e.g., less than 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 substitution(s).

TABLE 2 Comparison of zinc finger (ZF) motif sequences of selected DNA binding elements DNA DNA Binding # ZF target Element Motifs ZF Motif Sequences sequence Jazz 3 ZF1: CPVESCDRRFSRSDELT GCTGCTGC RHIRIH (SEQ ID NO: 54) G ZF2: CRICMRNFSSRDVLRRH (SEQ ID NRTH (SEQ ID NO: 55) NO: 9) ZF3: CDICGRKFASRDVLRRH NRIH (SEQ ID NO: 56) Bagly 4 ZF1: CPVESCDRRFSRSDELT CGGGCTGC RHIRIH (SEQ ID NO: 54) TGCG ZF2: CRICMRNFSSRDVLRRH (SEQ ID NRTH SEQ ID NO: 55) NO: 12) ZF3: CDICGRKFASRDVLRRH (human) NRIH (SEQ ID NO: 56) CCGGCTGC ZF4: CAECGKAFVESSKLKRH TGCG QLVH (SEQ ID NO: 57) (SEQ ID NO: 13) (mouse) UtroUp 6 ZF1: CPVESCDRRFSRSDNLV GCTGCTGC RHIRIH (SEQ ID NO: 58) GGGCTGGG ZF2: CRICMRNFSRSDHLTTH AG NRTH (SEQ ID NO: 59) (SEQ ID  ZF3: CDICGRKFADPGHLVRH NO: 14) NRIH (SEQ ID NO: 60) ZF4: CPVESCDRRFSRSDELT RHIRIH (SEQ ID NO: 61) ZF5: CRICMRNFSSRDVLRRH NRTH (SEQ ID NO: 55) ZF6: CDICGRKFASRDVLRRH NRIH (SEQ ID NO: 56) Fusion Proteins

The fusion proteins described herein generally contain from about e.g., 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000 or more amino acid residues. It will be understood by those of ordinary skill in the art that the polypeptides can also be prepared by other means including, for example, recombinant techniques, or by synthesis. Examples of appropriate cloning and sequencing techniques, and instructions sufficient to direct persons of skill through many cloning exercises are found in Sambrook et al., supra. Product information from manufacturers of biological reagents and experimental equipment, such as the SIGMA Chemical Company (Saint Louis, Mo.), and New England BioLabs (Ipswich, Mass.) also provide information useful in known biological methods. The following description briefly provides an overview of various recombinant polypeptide production methodologies applicable to certain embodiments of the present invention.

The polypeptides described herein are derived from transcriptional activation elements and DNA binding elements. The nucleotide sequence of many transcriptional activator elements and DNA binding elements are known. Accordingly, the known nucleic acid sequence can be used to make the polypeptides recombinantly or a nucleic acid encoding the desired polypeptide can be derived from the amino acid sequence. In certain embodiments, the transcriptional activation element of the fusion protein is selected from a group including but not limited to acidic or hydrophobic activation domains (e.g., from Gal4 or Gcn4, respectively), nine-amino-acid transactivation domains (9aaTAD, e.g., from p53, Vp16, MLL, E2A, HSF1, NF-IL6, and NF-κB), Vp64, p65, SP1, Zif268, and the trans-activation domain CJ7 derived from human Che-1/AATF. In some embodiments transcriptional activation element is derived from physiological regulators of utrophin expression, including NFAT, GABPα, and GABPβ. In some embodiments, a fusion protein has multiple transcriptional activation elements, for example, both Vp16 and CJ7.

In some embodiments, the DNA binding element of a fusion protein is a zinc finger domain, helix-turn-helix motif, leucine zipper domain, winged helix domain, winged helix turn helix domain, helix-loop-helix domain, HMG box domain, Wor3 domain, immunoglobulin domain, B3 domain, TAL effector DNA-binding domain, RNA-guided DNA-binding domain, or any DNA binding element described herein (e.g., Jazz, UtroUp, Bagly, and ZFP51 or an element having the sequence of SEQ ID NO:16-19, or having at least 50% (e.g., 50%, 55%, 60%, 70%, 72%, 75%, 80%, 81%, 85%, 90%, 95%, or 99%) identity to that of SEQ ID NO:16-19, or any of the zinc finger motifs described, e.g., in Tables 2 and 6.

Generally, this involves creating a nucleic acid sequence that encodes the polypeptide, placing the nucleic acid in an expression cassette under the control of a particular promoter, expressing the polypeptide in a host, isolating the expressed polypeptide and, if required, renaturing the polypeptide. Techniques sufficient to guide one of skill through such procedures are found in Sambrook et al., supra.

Provided with the polypeptide sequences described herein, one of skill will recognize a variety of equivalent nucleic acids that encode the polypeptide. This is because the genetic code requires that each amino acid residue in a peptide is specified by at least one triplet of nucleotides in a nucleic acid which encodes the peptide. Due to the degeneracy of the genetic code, many amino acids are equivalently coded by more than one triplet of nucleotides. For instance, the triplets CGU, CGC, CGA, CGG, AGA, and AGG all encode the amino acid arginine. Thus, at every position where an arginine is to be encoded by a nucleic acid triplet, the nucleic acid has any of the triplets which encode arginine. One of skill is thoroughly familiar with the genetic code and its use. An introduction to the subject is found in, for example, Chapter 15 of Watson et al., Molecular Biology, of the Gene (Fourth Edition, The Benjamin/Cummings Company, Inc., Menlo Park, Calif. (1987)), and the references cited therein.

Although any nucleic acid triplet or codon which encodes an amino acid can be used to specify the position of the amino acid in a peptide, certain codons are preferred. It is desirable to select codons for elevated expression of an encoded peptide, for example, when the peptide is purified for use as an immunogenic reagent. Codons are selected by reference to species codon bias tables, which show which codons are most typically used by the organism in which the peptide is to be expressed. The codons used frequently by an organism are translated by the more abundant tRNAs in the cells of the organism. Because the tRNAs are abundant, translation of the nucleic acid into a peptide by the cellular translation machinery is facilitated. Codon bias tables are available for most organisms. For an introduction to codon bias tables, see, e.g., Watson et al., supra.

In addition, it will be readily apparent to those of ordinary skill in the art that the fusion proteins described herein and the nucleic acid molecules encoding such fusion proteins can be subject to various changes, such as insertions, deletions, and substitutions, either conservative or non conservative, where such changes might provide for certain advantages in their use, e.g., to increase biological activity.

One of skill will appreciate that many conservative variations of nucleic acid constructs yield a functionally identical construct. For example, due to the degeneracy of the genetic code, silent substitutions (i.e., substitutions of a nucleic acid sequence which do not result in an alteration in an encoded peptide) are an acceptable feature of every nucleic acid sequence which encodes an amino acid. In addition, one of skill will recognize many ways of generating alterations in a given nucleic acid construct. Such well-known methods include site-directed mutagenesis, PCR amplification using degenerate oligonucleotides, exposure of cells containing the nucleic acid to mutagenic agents or radiation, chemical synthesis of a desired oligonucleotide (e.g., in conjunction with ligation and/or cloning to generate large nucleic acids) and other well-known techniques. See, Sambrook et al., supra.

Modifications to nucleic acids are evaluated by routine screening techniques in suitable assays for the desired characteristic. Modifications of other properties such as nucleic acid hybridization to a complementary nucleic acid, redox or thermal stability of encoded proteins, hydrophobicity, susceptibility to proteolysis, or the tendency to aggregate are all assayed according to standard techniques.

Similarly, conservative amino acid substitutions, in one or a few amino acids in an amino acid sequence of a protein are substituted with different amino acids with highly similar properties (see the definitions section, supra), are also readily identified as being highly similar to a disclosed construct.

In one preferred embodiment, the muscle-specific recombinant adeno-associated vector mAAV described above comprises, inserted in the polylinker or multiple cloning site, an exogenous gene which codes for a fusion protein comprising a DNA binding element of the zinc finger type and a transcriptional activation element. Exogenous genes which are preferred for this purpose are those which are capable of increasing utrophin expression. Among these, preference is given to the exogenous gene coding for the fusion protein known as “Vp16-Jazz” described in the Italian patent application RM2005A000493, which codes for a DNA binding domain with three zinc fingers (Jazz) fused with the strong transcriptional activation domain Vp16 of the Herpes simplex virus (the amino acid sequence of Vp16-Jazz is represented by SEQ ID NO:37). In other embodiments, genes encoding for other proteins, such as “Vp16-Bagly” or “Vp16-CJ7-UtroUp” are inserted into the multicloning site of the mAAV expression vector described above.

An exemplary mAAV expression vector of the invention including, inserted in the multiple cloning site, the sequence coding for the artificial transcription factor fusion protein “Vp16-Jazz”, is represented by SEQ ID NO:2. The main restriction sites, the regulatory region and the elements of the mAAV expression vector of SEQ ID NO:2 are shown in Table 3 below. It is to be understood that one or more of the elements contained in the vector described below can be replaced or modified. For example, different ITRs, promoter regions, splice acceptors, splice donors, epitope tags, nuclear localization sequences, etc., can be incorporated into vectors of the invention.

TABLE 3 Description of mAAV-Vp16-Jazz expression vector Position Main functional elements (bp) Main 2 NotI sites  149 restriction SEQ ID NO: 27 3658 sites GCGGCCGC 1 MluI site  155 SEQ ID NO: 28 ACGCGT ClaI 2225 SEQ ID NO: 31 ATCGAT BgIII 3023 SEQ ID NO: 32 AGATCT Regulatory Alpha-actin  156-2219 region promoter and part of the intron comprising the beta-globin gene acceptor splice donor: 1606 SEQ ID NO: 29 GCCCAGGTAGGG splice acceptor: 2154 SEQ ID NO: 30 CCCACAGCTCCT Vp16-Jazz MT-Vp16Jazz 2233-3139 Fusion Myc-Tag 2233-2428 Protein Nuclear 2452-2476 Localization Signal SEQ ID NO: 8 TGGGCCCTAAAAAGA AGCGTAAA Vp16 2476-2727 (transcriptional activation domain) Jazz (zinc 2727-3139 finger DNA binding domain) Elements Left-ITR   1-141 of the Right-ITR 3658-3799 AAV vector

In other embodiments of the invention, an mAAV expression vector can comprise a fusion protein that is different from Vp16-Jazz (e.g., fusion proteins that contain the DNA binding elements Bagly or UtroUp. Table 4 shows exemplary mAAV expression vectors including genes encoding for different fusion proteins of the invention.

TABLE 4 Exemplary mAAV-Fusion Protein Expression Vectors Fusion Protein SEQ ID (include myc- Vector Name NO: Promoter Splice Donor Splice Acceptor NLS) mAAV-Vp16- 2 Human alpha- Human Human Vp16-Jazz Jazz actin promoter alpha-actin beta-globin (SEQ ID (SEQ ID NO: 29) (SEQ ID NO: 30) NO: 66) mAAV-Vp16- 62 Human alpha- Human Human Vp16-Bagly Bagly actin promoter alpha-actin beta-globin (SEQ ID (SEQ ID NO: 29) (SEQ ID NO: 30) NO: 67) mAAV-Vp16- 63 Human alpha- Human Human Vp16-CJ7- CJ7-UtroUp actin promoter alpha-actin beta-globin UtroUp (SEQ ID NO: 29) (SEQ ID NO: 30) (SEQ ID NO: 68)

Table 5 describes exemplary fusion proteins that can be used in the present invention along with their known target DNA sequence. The general human target sequence and general mouse target sequence for the fusion proteins described in Table 5 are shown below. As shown in Table 5, the target sequences of these fusion proteins are located at different sites in the general human and mouse target sequences.

(Human target sequence) SEQ ID NO: 10 CGG-GCT-GCT-GCG-GGC-TGG-GAG (Mouse target sequence) SEQ ID NO: 11 CCG-GCT-GCT-GCG-GGC-TGG-GAG Other fusion proteins that are substantially the same, or derivatives of these examples, can also be used in the present invention. Any of the fusion proteins described in Table 5 below can be incorporated into a mAAV expression vector as described above. In some embodiments, the fusion proteins described below can include one or more additional elements, such as epitope tags or nuclear localization sequences.

TABLE 5 Exemplary Fusion Proteins and Target Sequences DNA Number Transcriptional Binding of Zinc Activation DNA target Element Fingers Element sequence Jazz 3 Vp16, CJ7, SEQ ID NO: 9 Gal4, SP1 GCT-GCT-GCG Bagly 4 Vp16 SEQ ID NO: 12 CGG-GCT-GCT-GCG (human) SEQ ID NO: 13: CCG-GCT-GCT-GCG (mouse) UtroUp 6 Vp16, CJ7 SEQ ID NO: 14  GCT-GCT-GCG- GGC-TGG-GAG

It is to be understood that the fusion proteins can consist essentially of a transcriptional activation element and a DNA binding element, for example, without other elements such as epitope tags or nuclear localization sequences. The amino acid sequences of exemplary “minimal” fusion proteins are described in, e.g., SEQ ID NO:71 (Vp16-Jazz); SEQ ID NO:72 (Vp16-Bagly); and SEQ ID NO:73 (Vp16-CJ7-UtroUp). In other embodiments, additional elements such as epitope tags or nuclear localization sequences are included in the fusion protein sequence. The amino acid sequences of exemplary fusion proteins that include epitope tags and nuclear localization sequences are described in, e.g., SEQ ID NO: 34 (myc-NLS-Vp16-Jazz); SEQ ID NO:35 (myc-NLS-Vp16-CJ7-UtroUp); and SEQ ID NO:36 (myc-NLS-Vp16-Bagly). In certain embodiments, the fusion proteins of the invention have at least 50% identity to SEQ ID NOs:71-73 or SEQ ID Nos:34-36, e.g., at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 90%, 95%, 97%, or 99% identity.

Modified Transcription Factors Capable of Increasing Utrophin Expression

In certain embodiments, a genomically-encoded (i.e., endogenous) transcription factor, for example a zinc finger transcription factor, is modified or engineered to change the target sequence that it specifically binds. In preferred embodiments, the modified transcription factor is derived from a human zinc finger transcription factor. In some embodiments, the modified transcription factor is derived from Zif268, also known as EGR-1 or EGR1. In preferred embodiments, the human zinc finger transcription factor is modified to bind to the utrophin “A” promoter, for example to a region of SEQ ID NOs:20 or 33. The DNA-binding element(s) of a human transcription factor can be modified as discussed above in relation to the DNA binding elements of fusion proteins. For example, the DNA sequences encoding for the zinc finger motifs of Zif268 can be replaced with the DNA sequences encoding for the zinc finger motifs of any of the DNA binding elements described herein, including Jazz, Bagly, or UtroUp, or with zinc finger motifs that have about having at least 70% (e.g., 72%, 75%, 80%, 81%, 85%, 90%, 95%, or 99%) identity to these zinc finger motifs. For example, the zinc finger motifs may have at least 70% identity to any of SEQ ID NOs:16-18.

In some embodiments, the DNA sequences of a human zinc finger protein, for example of Zif268, are modified so that they encode different amino acids in one or more of the zinc finger motif alpha-helix, especially at the −1, 3, and 6 positions of the zinc finger alpha-helix which are especially important in conferring binding to specific DNA targets. The methods described in the preceding section, for example, can also be used to modify the DNA binding sequence of an endogenous transcription factor, such that the endogenous transcriptional activation element (e.g., a trans-activation domain) of the modified transcription factor can induce increased utrophin expression relative to a reference.

In some embodiments, one or more of the −1, 3, and 6 positions, or residues within the zinc finger alpha helix, of a zinc finger motif of human Zif268 are modified to enable the modified transcription factor to bind the utrophin “A” promoter. In other embodiments, one or more additional amino acid residues of a zinc finger motif, preferably in the zinc finger alpha-helix that confers DNA recognition, are further modified, e.g., position 1, 2, 4, 5, 7, 8, or 9. In some embodiments zinc finger motif of a modified transcription factor of the invention (e.g., JZif1 or JZif2) may have 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more sequence identity to a known zinc finger motif, with only a few different amino acid residues, e.g., 1, 2, 3, 4, 5, 6, 7, or 8 amino acid residues modified from the genomically-encoded transcription factor (e.g. Zif268). Residues that are involved structural integrity or proper three dimensional protein folding, e.g., in zinc coordination, are disfavored for modification. In some embodiments, one or more of the zinc finger motifs of a modified transcription factor are substantially the same as compared to the genomically-encoded wild-type transcription factor, while other zinc finger motifs are modified; this can be preferred if the desired 3 bp target sequence for a given zinc finger motif is the same for both the wild-type and the corresponding modified zinc finger motif. In preferred embodiments, the zinc finger motifs of Zif268 are modified to obtain the zinc finger sequences shown in Table 6 or described in SEQ ID NOs:48-53. In some embodiments, the modified transcription factors of the invention comprise one or more zinc finger motifs that have at least 50% sequence identity to any of the zinc finger sequences shown in Table 6 or described in SEQ ID NOs:48-53, e.g., 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity. In certain embodiments, the zinc finger motifs of the modified transcription factors of the invention have less than 15 amino acid substitutions compared to any of the zinc finger motifs described in Table 6 or in SEQ ID NOs:48-53; e.g., less than 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 substitution(s).

In preferred embodiments, the modified transcription factors do not have epitope tags or other elements. For example, the JZif1 and JZif2 modified transcription factors described herein do not have an epitope tag. In the absence of an epitope tag, in some embodiments the expression of modified transcription factors is monitored using Western blot or indirect immunofluorescence assays using antibodies that recognize the genomically-encoded transcription factor from which the modified transcription factor of the invention is derived, which can reveal an augmentation or increase in protein levels compared to reference. This increase in protein levels detected by the antibody compared to reference can indicate that the modified transcription factor is being expressed.

Exemplary modified transcription factors JZif1 and JZif2 of the invention derived from human Zif268 are shown, e.g., in Table 6 and in FIG. 12. As used herein, the terms “Zif1,” “Jazz-Zif1,” “JZif1,” “ZIFJ1,” “ZIF1,” and “Ziff1” are interchangeable. As used herein, the terms “Zif2,” “Jazz-Zif2,” “JZif2,” “ZIFJ2,” “ZIF2,” and “Ziff2” are interchangeable.

TABLE 6 Comparison of Zif268 and Modified Transcription Factors JZif1 and JZif2 Zinc Finger (ZF) Motif Sequences (Amino acids in positions -1, 3 Tran- and 6 of each scrip- Number ZF motif are DNA tion of Zinc underlined and target Factor Fingers in bold). sequence Zif268 3 ZF1: CPVESCDRRFS R SD E LT SEQ ID R HIRIH (SEQ ID NO: 45) NO: 44 ZF2: CRICMRNFS R SD H LT T H GCGTGGG IRTH (SEQ ID NO: 46) CG ZF3: CDICGRKFA R SD E RK R H TKIH (SEQ ID NO: 47) JZif1 3 ZF1: CPVESCDRRFS R SD E LT SEQ ID (SEQ ID R HIRIH (SEQ ID NO: 48) NO: 9 NO: 38) ZF2: CRICMRNF S SRD V LR R H GCTGCTG NRTH (SEQ ID NO: 49) CG ZF3: CDICGRKFA S RD V LR R H NRIH (SEQ ID NO: 50) JZif2 3 ZF1: CPVESCDRRFS R SD N LV SEQ ID (SEQ ID R HIRIH (SEQ ID NO: 51) NO: 40 NO: 39) ZF2: CRICMRNFS R SD H LT T H GGC-TGG- IRTH (SEQ ID NO: 52) GAG ZF3: CDICGRKFA D PG H LV R H NRIH (SEQ ID NO: 53)

In some embodiments, the modified transcription factor has an amino acid sequence as defined by SEQ ID NO:38 (JZif1) (see FIG. 12B) or SEQ ID NO:39 (JZif2) (see FIG. 12C). The corresponding DNA sequences that encode these proteins are described in SEQ ID NO:69 (JZif1) or SEQ ID NO:70 (JZif2). The sequences coding for these modified transcription factors can be incorporated into mAAV expression vectors of the invention. For example, the sequence of a mAAV expression vector called mAAV-JZif1 in which JZif1 has been incorporated into the multi-cloning site of mAAV is described in SEQ ID NO:64, and the sequence of a mAAV expression vector called mAAV-JZif2 in which JZif2 has been incorporated into the multi-cloning site of mAAV is described in SEQ ID NO:65. Table 7 shows exemplary mAAV expression vectors including genes encoding for different modified human transcription factors of the invention. In some embodiments, the modified transcription factor has at least 50% sequence identity to any of the sequences described by SEQ ID NOs:38, 39, 69, 70, 64, or 65, e.g., 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity.

TABLE 7 Exemplary mAAV-Modified Transcription Factor Expression Vectors Vector SEQ ID Modified Human Name NO: Promoter Splice Donor Splice Acceptor Transcription Factor mAAV- 64 Human alpha- Human Human JZif1 JZif1 actin promoter alpha-actin beta-globin (SEQ ID NO: 69) (SEQ ID NO: 29) (SEQ ID NO: 30) mAAV- 65 Human alpha- Human Human JZif2 JZif2 actin promoter alpha-actin beta-globin (SEQ ID NO: 70) (SEQ ID NO: 29) (SEQ ID NO: 30)

Recombinant AAV of the invention (e.g., recombinant AAV comprising a muscle-specific promoter and a gene encoding an artificial transcription factor (e.g., a fusion protein or modified transcription factor) can be produced, for example, using the mAAV expression vectors described, e.g., in Table 4 and Table 7 using methods known in the art or as described herein. For example, recombinant AAVs of the invention can be produced by the triple-plasmid transfection method, which can include use of the mAAV expression vectors and AAV capsid expression plasmids (e.g., AAV8 or AAV6 capsid expression plasmids). As shown in Table 8, recombinant AAV can comprise, for example, polynucleotides defined by SEQ ID NOs:83-87, which comprise a muscle specific promoter and a gene encoding an artificial transcription factor. Polynucleotides defined by SEQ ID NOs:83-87 can be incorporated into recombinant AAV, for example, during viral packaging. Recombinant AAV comprising polynucleotides having the sequence defined by, e.g., SEQ ID NOs:83-87 can be administered to a subject, e.g., during treatment of a muscle defect. In some embodiments, the invention provides for a use for recombinant AAVs of the invention as a medicament. In other embodiments, the invention provides a use for recombinant AAVs of the invention in a method of treatment, e.g., for a muscle disease including DMD or BMD.

TABLE 8 Exemplary Polynucleotides Incorporated into Recombinant AAV Artificial Transcription SEQ ID NO: Promoter Factor 83 Human alpha-actin promoter Vp16-Jazz 84 Human alpha-actin promoter Vp16-Bagly 85 Human alpha-actin promoter Vp16-CJ7-UtroUp 86 Human alpha-actin promoter JZif1 87 Human alpha-actin promoter JZif2 Immunogenicity

The invention provides for methods of determining the immunogenicity of a fusion protein or a modified transcription factor of the invention. In some embodiments, the level of antibodies that bind the fusion protein or modified transcription factor (referred to herein as anti-drug antibodies) are measured, e.g., by antigen-binding tests, sandwich ELISA, bridging ELISA, or surface plasmon resonance. In some embodiments, a loss of efficacy of treatment over time or adverse immunological reactions can indicate immunogenicity of the fusion protein or modified transcription factor. Other methods for measuring immunogenicity are described, e.g., in De Groot et al., Clin. Immunol. 131:189-201, 2009.

The fusion proteins and modified transcription factors of the invention can be modified to reduce immunogenicity by any method known in the art. For example, in some embodiments a fusion protein or modified transcription factor of the invention is modified by specific deletion of human T cell epitopes or “deimmunization” by the methods disclosed in international patent applications WO 98/52976 and WO 00/34317. Briefly, the sequence of the fusion protein or modified transcription factor can be analyzed for peptides that bind to MHC Class II; these peptides represent potential T-cell epitopes (as defined in WO 98/52976 and WO 00/34317). For detection of potential T-cell epitopes, a computer modeling approach termed “peptide threading” can be applied, and in addition a database of human MHC class II binding peptides can be searched for motifs present in the fusion protein or modified transcription factor sequences, as described in international Patent Applications WO 98/52976 and WO 00/34317. These motifs bind to any of the 18 major MHC class II DR allotypes, and thus constitute potential T cell epitopes. Potential T-cell epitopes detected can be eliminated by substituting small numbers of amino acid residues in the fusion protein, or preferably, by single amino acid substitutions. Typically, conservative substitutions are made. Other methods for engineering fusion proteins with reduced immunogenicity are described, e.g., in De Groot et al., Clin. Immunol. 131: 189-201, 2009.

In some embodiments, the amino acid changes introduced into a genomically-encoded human transcription factor, e.g., Zif268, to produce a modified transcription factor of the invention are not substantially recognized by the host immune system. Without being bound by theory, this could be due to the fact that the novel zinc finger motifs are in the pool of the thousands of natural zinc finger proteins expressed in humans that vary in the zinc finger alpha-helix in order to bind different DNA targets. In some embodiments, modified human transcription factors capable of increasing utrophin expression are less immunogenic than fusion proteins containing proteins from viral or other sources, such as Vp16.

Pharmaceutical Compositions

The recombinant AAVs of the invention as detailed above are preferably assessed for contamination by conventional methods and then formulated into a pharmaceutical composition. In preferred embodiments, the pharmaceutical composition is intended for injection. Such formulation involves the use of a pharmaceutically and/or physiologically acceptable vehicle or carrier, particularly one suitable for injection, such as buffered saline or other buffers, e.g., HEPES, to maintain pH at appropriate physiological levels, and, optionally, other medicinal agents, pharmaceutical agents, stabilizing agents, buffers, carriers, adjuvants, diluents, etc. For injection, the carrier will typically be a liquid. Exemplary physiologically acceptable carriers include sterile, pyrogen-free water and sterile, pyrogen-free, phosphate buffered saline. A variety of such known carriers are provided in U.S. Pat. No. 7,629,322. In one embodiment, the carrier is an isotonic sodium chloride solution. In another embodiment, the carrier is balanced salt solution. In one embodiment, the carrier includes Tween. If the virus is to be stored long-term, it may be frozen in the presence of glycerol or Tween20.

A pharmaceutical composition of this invention can be administered parenterally, orally, nasally, rectally, topically, or buccally. They can also be administered locally or systemically. In preferred embodiments, the pharmaceutical composition is administered parentally. In other preferred embodiments, the pharmaceutical composition is administered systemically. The pharmaceutical compositions can be administered once, or multiple times, at the same or at different sites. The volume and viral titer of each injection is determined individually, as further described below, and may be the same or different from other injections performed during the course of administration. In one embodiment, a particular muscle is selected for intramuscular administration. For example, in the case of mAAV8-Vp16-Jazz and mAAV6-Vp16-Jazz, local infection can be achieved by intramuscular (i.m) injection, for example, by injection of a viral suspension into quadriceps or tibialis anterioris (TA), which can be performed on either limb.

The composition may be delivered in a volume of from about 50 μl to about 2 L, e.g., 50 μL, 100 μL, 150 μL, 200 μL, 250 μL, 500 μL, 1 mL, 2 mL, 5 mL, 10 mL, 20 mL, 25 mL, 50 mL, 100 mL, 150 mL, 200 mL, 250 mL, 300 mL, 400 mL, 500 mL, 600 mL, 700 mL, 800 mL, 900 mL, 1 L, 1.2 L, 1.3 L, 1.4 L, 1.5 L, 1.6 L, 1.7 L, 1.8 L, or 2 L, including all numbers within the range, depending on the size of the patient to be treated, the viral titer used, the route of administration, and the desired effect of the method. Other volumes may be useful as determined by a physician or other person skilled in the art. An effective concentration of a recombinant adeno-associated virus carrying a nucleic acid sequence encoding the desired fusion protein under the control of the muscle-specific promoter desirably ranges between about 10⁸ and 10¹³ vector particles per milliliter (v.p./mL), e.g., 10⁸ v.p./mL, 10⁹ v.p./mL, 10¹⁰ v.p./mL, 10¹¹ v.p./mL, 10¹² v.p./mL, or 10¹³ v.p./mL. The recombinant AAV infectious units can be measured, for example, as described in McLaughlin et al., J. Virol. 62:1963-1973, 1988. It is desirable that the lowest effective concentration of virus be utilized in order to reduce the risk of undesirable effects, such as toxicity. Still other dosages in these ranges may be selected by the attending physician, taking into account the physical state of the subject, preferably human, being treated, the age of the subject, the particular muscle defect, and the degree to which the disorder, if progressive, has developed.

In some embodiments, the invention provides a pharmaceutical composition comprising a recombinant AAV of the invention for use as a medicament. In other embodiments, the invention provides a pharmaceutical composition comprising a recombinant AAV of the invention for use in a method of treatment, e.g., treatment of a muscular defect. In one such embodiment, the method comprises administering to a subject an effective amount of the pharmaceutical composition of the recombinant AAV. In a further aspect, the invention provides for the use of a pharmaceutical composition of the invention for the manufacture of a medicament. In one embodiment, the medicament is for treatment of a muscle disease, e.g., DMD or BMD.

Increasing Utrophin Expression to Treat Muscle Defects

The present invention provides compositions and methods for increasing utrophin expression, preferably for the treatment of muscle defects, most preferably dystrophinopathies. The approach of the present invention is to increase utrophin expression by contacting the utrophin gene with fusion proteins or modified human transcription factors of the invention, thereby functionally rescuing the muscle defects caused by loss of dystrophin protein function. In preferred embodiments, contacting the utrophin “A” promoter of the utrophin gene with the fusion proteins of the invention leads to increased expression of utrophin.

The compositions and methods of the present invention can be used to increase the expression of the utrophin gene of any species that possesses the gene, e.g., mice, dogs, or primates including humans. The human utrophin gene is described, e.g., in NCBI Entrez Gene ID: 7402. The mRNA sequence of the human utrophin gene can be found, e.g., under NCBI accession number NM_007124. The sequence of the human utrophin protein can be found, e.g., under UniProtKB accession number P46939 or NCBI accession number NP_009055. The mouse utrophin gene is described, e.g., in NCBI Entrez Gene ID: 22288. The mRNA sequence of the mouse utrophin gene can be found, e.g., under NCBI accession number NM_011682. The sequence of the mouse utrophin protein can be found, e.g., under UniProt KB accession number E9Q6R7 or NCBI accession number NP_035812.

In certain embodiments, fusion proteins of the invention, e.g., those having the amino acid sequence of SEQ ID NO:34-38, specifically bind to the utrophin “A” promoter. The utrophin “A” promoter can comprise the DNA sequence of SEQ ID NO:20 (human) or SEQ ID NO:33 (mouse). The target DNA sequence bound by the fusion protein can comprise the DNA sequence of any of SEQ ID NOs 9-14, 40. Binding of the fusion proteins to the utrophin “A” promoter can subsequently lead to an increase in utrophin transcription due to the presence of the transcriptional activation element near the transcriptional start site of the utrophin gene. Alternatively, the target sequence of the fusion proteins can be in the utrophin gene itself or in other utrophin regulatory regions known in the art, e.g., the utrophin “B” promoter. In other embodiments, modified transcription factors, including modified human transcription factors of the invention, for example those having the amino acid sequence of SEQ ID NOs 38 or 39, specifically bind to the utrophin “A” promoter. The target DNA sequence bound by the modified human transcription factor can comprise the DNA sequence of SEQ ID NOs 9 or 40. Binding of the modified human transcription factor to the utrophin “A” promoter will subsequently lead to an increase in utrophin transcription due the presence of the endogenous trans-activation domain of the modified transcription factor near the transcriptional start site of the utrophin gene. Alternatively, the target sequence of the modified transcription factors can be in the utrophin gene itself or in other utrophin regulatory regions known in the art, e.g., the utrophin “B” promoter.

Methods of Treatment

The present invention provides compositions and methods for prophylaxis or treatment of muscle diseases and defects. Non-limiting examples of muscle defects include inherited diseases, such as: myopathy, dystrophy (e.g., DMD, BMD, limb-girdle, congenital, facioscapulohumeral, myotonic, oculopharyngeal, distal, or Emery-Dreifuss muscular dystrophy), myotonia, congenital myopathies (e.g., nemaline, multi/minicore, or centronuclear myopathy), mitochondrial myopathies, familial periodic paralysis, inflammatory myopathies, metabolic myopathies. Muscle defects also include acquired muscle defects, such as: external substance induced myopathy (e.g., drug-induced myopathy, glucocorticoid myopathy, alcoholic myopathy, or other myopathies caused by toxic agents), dermatomyositis, polymyositis, inclusion body myositis, myositis ossificans, rhabdomyolysis, and myoglobinurias.

In preferred embodiments, these diseases are muscular dystrophy. In one embodiment, the muscle disease is BMD. In the most preferred embodiments, the muscle disease is DMD. Generally, the methods include administering to a mammalian subject in need thereof, an effective amount of a composition comprising a recombinant AAV carrying a nucleic acid sequence encoding a fusion protein capable of increasing utrophin expression under the control of a muscle-specific promoter.

Prophylactic treatment may be administered, for example, to a subject who is not yet ill, but who is susceptible to, or otherwise at risk of, a particular biological condition, including DMD (e.g., the subject may have mutations that cause DMD but is asymptomatic or the status of mutations that cause DMD is unknown). Therapeutic treatment may be administered, for example, to a subject already suffering from DMD in order to improve or stabilize the subject's condition (e.g., a patient already presenting symptoms of DMD).

Symptoms of muscular dystrophy including DMD, which may vary from mild to severe and may depend on what part of the body is affected, the causative mutation, and the age and overall health of the affected person, include, e.g., fatigue, learning difficulties, intellectual disability, muscle weakness (e.g., in the legs, pelvis, arms, neck, or other areas of the body), difficulty with motor skills (e.g., running, hopping, or jumping), frequent falls, trouble getting up from a lying position or climbing stairs, progressive difficulty walking, breathing difficulties, heart disease, abnormal heart muscle (e.g., cardiomyopathy), congestive heart failure, irregular heart rhythm (e.g., arrhythmias), deformities of the chest or back (scoliosis), enlarged muscles of the calves, buttocks, or shoulders, pseudohypertrophy, muscle deformities, respiratory disorders (e.g., pneumonia or poor swallowing). Detecting an improvement in, or the absence of, one or more symptoms of muscular dystrophy, indicates successful treatment.

In some embodiments, as compared with an equivalent untreated reference, treatment may ameliorate a disease or disorder (e.g., Duchenne's muscular dystrophy) or a symptom of the disease or disorder, or reduce the progression, severity, or frequency of one or more symptoms of the disease or disorder by, e.g., 1%, 2%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 100% as measured by any standard technique. For measuring symptoms of Duchenne's muscular dystrophy, one may use, e.g., electromyography (EMG), genetic tests, muscle biopsy, serum Creatine Kinase (CK) levels, muscular strength tests (e.g., manual muscle testing), or range-of-motion (ROM) tests, for example, the six-minute walk test, or by other methods known to skilled practitioners. Detecting an improvement in, or the absence of, one or more symptoms of muscular dystrophy, indicates successful treatment.

Assessment of Muscle Function

The effects of the compositions and methods of the invention on muscle function of a subject can be assessed by any assay known to one of skill in the art. For example, muscle strength, endurance, or range-of-motion of any desired muscle or group of muscles can be assessed. Some assays primarily assess muscle strength or muscle endurance, while other assays assess a combination of strength, endurance, or other muscle properties. In certain embodiments, treatment with the compositions and methods of the present invention lead to an improvement in one or more parameters of muscle function in a subject. An improvement can indicate an increase in the parameter (e.g., increased muscle strength or endurance). In muscle diseases characterized by degeneration including DMD, improvement can also indicate maintenance or a reduction in the rate of decline in a parameter over time. In preferred embodiments, the muscle function of a subject having a muscle disease (e.g., muscular dystrophy) being treated with the compositions and methods of the invention is assessed by these assays. In particularly preferred embodiments, treatment with the compositions and methods of the invention lead to an improvement in multiple parameters of muscle function.

The assessment of muscle function can occur prior to the start of treatment with the methods and compositions of the invention, and/or at any time following treatment to determine the response of the subject to the treatment. Assessment of muscle function can occur at fixed periods of time following treatment, for example, every day, every week, every two weeks, every month, every 6 weeks, every 6 months, every year, or any period of time in a range spanning any of these periods of times. Alternatively, assessment can occur at a time of choosing of a skilled practitioner including a researcher or a physician. The assay used for assessing muscle function can be chosen depending upon the condition of the patient; for example, ambulatory patients can be subjected to 6-minute walk tests (6MWT), whereas other assays may be performed on wheelchair-bound or otherwise immobilized patients. Because the loss of muscle function in muscle diseases, including DMD, can occur in the context of normal childhood growth and development, which are associated with, for example, increases in stride length or muscle size, the results obtained from these assays can be compared to age-matched references.

Administration of compositions and methods of the invention can result in improved function of skeletal and/or cardiac muscle of a subject. The function of any muscle (e.g., heart, diaphragm, extensor digitorum longus, tibialis anterior, gastrocnemius, soleus, plantaris, biceps, triceps, deltoids, pectoralis major, pectoralis minor, rhomboids, trapezius, sartorius, knee flexors and extensors, elbow flexors and extensors, shoulder abductors, etc.) or group of muscles in the body (e.g., muscles of the head, neck, torso, chest, abdomen, pelvis, perineum, upper limbs, lower limbs, etc.), can be improved by treatment using the claimed compositions and methods. The movements or actions assessed in these assays may involve cooperation of several groups of muscles or the whole body.

The compositions and methods of the invention can result in an improvement in muscle strength or force generation of a muscle or group of muscles of a subject. The force generated by a muscle can be measured by any assay known in the art. The assays can in vivo, in situ, or ex vivo assays, for example ex vivo or in situ analysis of the contractile profile of a single intact limb muscle (e.g. the extensor digitorum longus for an ex vivo assay and the tibialis anterior muscle for an in situ assay), grip force analysis, downhill treadmill exercise, manual muscle testing, myometry (e.g. assessing upper and lower extremity strength using a myometer, including evaluation of knee flexors and extensors, elbow flexors and extensors, and shoulder abductors). sustained maximum voluntary contraction (MVC) assays or in any of the assays described in Hakim et al., Methods Mol. Biol. 709: 75-89, 2011; Sharma et al., Neurology 45: 306-310, 1995; and McDonald et al., Muscle Nerve 48: 3430356, 2013.

In certain embodiments, administration of compositions comprising the recombinant AAV of the invention to a subject can lead to an improvement in muscle strength or force generation of one or more muscles of a subject as assessed by the assays described above by at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, or by 3-fold, 4-fold, 5-fold, or more, compared to reference. In other embodiments, treatment maintains muscle strength within about 25% of a reference value. In other embodiments treatment reduces the rate of decrease in muscle strength that occurs over time in muscle diseases, including DMD, by at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, or by 3-fold, 4-fold, 5-fold, or more, compared to reference. In preferred embodiments, treatment of subjects having muscular dystrophy with the compositions and methods of the invention leads to an increase, maintenance of muscle strength, or reduction in the rate of decline in muscle strength or force of one or more muscles as assessed by any assay known in the art or described herein. In particularly preferred embodiments, treatment of subjects having muscular dystrophy causes an increase in muscle strength, maintenance, or a reduction in the rate of loss of muscle strength over time as measured by myometry. For example, treatment with the compositions and methods of the invention can increase the force (measured by a myometer) exerted by the knee flexors or extensors, elbow flexors or extensors, or shoulder abductors by a patient with DMD by at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, or by 3-fold, 4-fold, 5-fold, or more compared to reference.

Treatment with the compositions and methods of the invention can also improve muscle endurance of a muscle or group of muscles of a subject. Muscle endurance may be measured by assays including but not limited to treadmill exercise, 6-minute walk test (6MWT), timed function tests, (e.g., time taken to stand from a supine position, time taken to run/walk 10 m, or time taken to climb/descend 4 standard-sized stairs), by any of the tests suitable for testing muscular strength or endurance in mice including but not limited to enforced treadmill exercise, either at constant speed (e.g., any assay described in Radley-Crabb et al., Neuromuscul. Disord. 22(2):170-182, 2012) or at accelerated speed (see, e.g., Di Certo et al., Hum. Mol. Genet. 19:752-760, 2010 or Strimpakos et al., J. Cell. Phys. 229:1283-1291, 2014), voluntary wheel exercise, grip strength test, the hang wire test, the inverted grid test, and the rotarod test, or by any of the assays described in McDonald et al., Muscle Nerve 48: 343-356, 2013.

In certain embodiments, treatment with the compositions and methods of the invention increases the subject's muscle endurance by at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold, or more compared to reference) over a specified amount of time. In other embodiments treatment maintains muscle endurance or reduces the rate of decrease in muscle endurance that occurs over time in muscle diseases including DMD by at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, or by 3-fold, 4-fold, 5-fold, or more compared to reference. For example, treatment with the compositions and methods of the invention can increase the distance that a subject can walk in the time period of 6 minutes as measured by the 6MWT, reduce the number of falls that occur during the 6MWT, maintain the distance a patient can walk as measured by the 6MWT over time, or reduce the rate of decrease in the distance a patient can walk as measured by the 6MWT over time. For example, prior to treatment a patient may have a 6MWT distance in the range of about 0 m to 400 m, but treatment with the compositions and methods of the invention may increase this value by 5 m, 10 m, 20 m, 30 m, 40 m, 50 m, 60 m, 70 m, 80 m, 90 m, 100 m, 200 m, 300 m, 400 m, or any distance in a range spanning these numbers over 1 week, 6 weeks, 12 weeks, 18 weeks, 24 weeks, 30 weeks, 36 weeks, 42 weeks, 48 weeks, 52 weeks, or more of treatment. In other embodiments, treatment may maintain the 6MWT distance within about 25% of the baseline value over 1 week, 6 weeks, 12 weeks, 18 weeks, 24 weeks, 30 weeks, 36 weeks, 42 weeks, 48 weeks, 52 weeks, or more of treatment. In other embodiments, treatment may reduce the rate of decline in the 6MWT distance by 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 150%, 2-fold, 3-fold, 4-fold, 5-fold, or more over 1 week, 6 weeks, 12 weeks, 18 weeks, 24 weeks, 30 weeks, 36 weeks, 42 weeks, 48 weeks, 52 weeks, or more of treatment.

The compositions and methods of the invention can also improve serum creatine kinase (CK) levels of a subject. Serum CK levels can be measured by any assay known in the art, including in a coupled enzyme reaction where the rate of NADPH formation is measured photometrically and is directly proportional to the CK activity. In one example, samples can be centrifuged for 5 min (12,000 g) at 4° C., and serum free of hemolysis can be removed. Serum CK activity can be evaluated using the CK-NAC kit (Greiner) and analyzed kinetically by a spectrophotometer (multi-label counter Victor 3; Wallac), by setting the wavelength at 340 nm and the temperature at 37° C. For subjects with DMD, the serum CK levels are typically strongly elevated compared to normal reference controls. After exercise, the serum CK levels of subjects with DMD can increase further, for example, by about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 100%, or more compared to before exercise. In certain embodiments, treatment with the compositions and methods of the invention can reduce the subject's serum CK levels, for example prior to or following exercise, by about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% or more compared to reference. In some embodiments, an improvement in serum CK levels following treatment is correlated with improvements in muscle endurance and/or muscle contractile force.

Improvement in muscle function can also be determined as an alleviation of symptoms of muscle disorders. Symptoms of muscle disorders can include, e.g., fatigue, learning difficulties, intellectual disability, muscle weakness (e.g., in the legs, pelvis, arms, neck, diaphragm, heart, or other areas of the body), difficulty with motor skills (e.g., running, hopping, or jumping), frequent falls, trouble getting up from a lying position or climbing stairs, progressive difficulty walking, breathing difficulties, heart disease, abnormal heart muscle (e.g., cardiomyopathy), congestive heart failure, irregular heart rhythm (e.g., arrhythmias), deformities of the chest or back (scoliosis), enlarged muscles of the calves, buttocks, or shoulders, pseudohypertrophy, muscle deformities, respiratory disorders (e.g., pneumonia or poor swallowing). Detecting an improvement in, or the absence of, one or more symptoms of muscular dystrophy, can indicate an improvement in muscle function.

Combination Therapy

The compositions and methods of the invention can also be used in conjunction with other remedies known in the art that are used to treat muscular dystrophy or its complications, including but not limited to: corticosteroids (e.g., cortisol, hydrocortisone, prednisone, prednisolone, deflazacort, triamcinolone, methylprednisolone, dexamethasone, betamethasone, aldosterone, and fludrocortisone); β2-adrenergic agonists (e.g., albuterol, salbutamol, levosalbutamol, terbutaline, pirbuterol, procaterol, clenbuterol, metaproterenol, fenoterol, bitolterol mesylate, ritodrine, isoprenaline, salmeterol, formoterol, bambuterol, and indicaterol); immunosuppressants (e.g., cyclosporine); anti-fibrotic drugs (e.g., peginterferon, IL-10, pioglitazone, pentoxifylline, atanercept); exon-skipping drugs (e.g., antisense oligonucleotides that target exon 51, exon 45, or exon 53 including drisapersen, eteplirsen, PRO044, PRO45, PRO051, and PRO053); stop-codon skipping drugs, (e.g., gentamycin or other aminoglycoside antibiotics and Ataluren (PTC124)); synthetic anabolic steroids (e.g., oxandrolone); osteoporosis remedies (e.g., vitamin D and calcium); constipation remedies including laxatives; cardiomyopathy remedies including angiotensin-converting enzyme (ACE) inhibitors (e.g., benazepril, captopril, enalapril, fosinopril, lisinopril, moexipril, perindopril, quinapril, ramipril, and trandolapril), diuretics, beta-blockers (e.g., bisoprolol or carvedilol), anti-arrhythmic medications (e.g., amiodarone); insulin-like growth factor (IGF-1); myostatin inhibitors (e.g., follistatin, ACE-031, and neutralizing antibodies including MYO-029); drugs that increase nitric oxide levels and/or nNOS protein levels or activity (e.g., L-arginine; phosphodiesterase inhibitors including sildenafil, tadalafil, and pentoxifylline); class II histone deacetylase (HDAC) inhibitors; small molecules that increase utrophin expression (e.g., SMT C1100); nutritional supplements (e.g., glutamine, creatine monohydrate, conjugated linoleic acid, alpha-lipoic acid, and beta-hydroxy-beta-methylbutyrate); anti-histamines (e.g. fexofenadine, loratadine, phenindamine, dexchlorpheniramine, terfenadine, cetirizine, etc.); mast cell stabilizers (e.g., sodium cromoglicate, nedocromil sodium, etc., which can be in a form of aerosols, inhalations, eye drops, etc.); coenzyme Q₁₀ (also known as ubiquinone or ubidecarenone); idebenone or other synthetic derivatives of ubidecarenone, (e.g., RAXONE®/CATENA®); omega 3; resveratrol; phytosterols/stanols; anticoagulants (e.g., warfarin); and anticholinergic agents (e.g., anti-muscarinic agents (e.g., atropine, benztropine (COGENTIN®), biperiden, chlorpheniramine (CHLOR-TRIMETON), dicyclomine (dicycloverine), dimenhydrinate (DRAMAMINE)), diphenhydramine (BENADRYL®, SOMINEX™, ADVIL®, PM, etc.), doxylamine (UNISOM™), glycopyrrolate (ROBINUL®), ipratropium (ATROVENT®), orphenadrine, oxitropium (OXIVENT®), oxybutynin (DITROPAN®, DRIPTANE®, LYRINEL® XL), tolterodine (DETROL®, DETRUSITOL), tiotropium (SPIRIVA®), trihexyphenidyl, scopolamine, solifenacin, and tropicamide), and anti-nicotinic agents (e.g., ganglion blockers including bupropion (ZYBAN®, WELLBUTRIN®) and hexamethonium, cough suppressants and ganglion blockers (e.g., dextromethorphan), nondepolarizing skeletal muscular relaxants (e.g., doxacurium and tubocurarine), ganglion blockers and occasional smoking cessation aids (e.g., mecamylamine)), which can be in a form of inhalers, nebulizer solutions, tablets, and can be administered by rectal, oral, transdermal, or parenteral routes). The compositions and methods of the present invention can also be used with, for example, other gene-based therapy approaches (e.g., viral delivery of mini- or micro-dystrophin, mini-utrophin, or trans-splicing recombinant AAV vectors); gene editing, including approaches involving zinc-finger nucleases, transcription activator-like (TAL) type III effector nucleases (TALENs), meganucleases, or clustered regularly interspaced short palindromic repeats (CRISPR), or with cell-based therapies involving transplantation of various types of precursor cells, such as ex vivo-maniupulated muscle side population cells (a lineage of uncommitted cells) into muscle fibers. The compositions and methods can also be used with any of the therapies described in Miura et al., Trends. Mol. Med. 12:122-129, 2006; Jarmin et al., Expert Opin. Biol. Ther. 14:209-230, 2014; Scully et al., Expert Opin. Orphan Drugs 1:33-46, 2013; Fairclough et al., Exp. Physiol 96:1101-1113, 2011; and Blankinship et al., Mol. Therapy., 13:241-249, 2006.

The compositions and methods of the invention can also be used in conjunction with other forms of treatment including but not limited to: physical exercise (e.g. physical therapy, range-of-motion exercises); mobility aids, supports or orthotic devices, (e.g. ankle splints, knee-ankle-foot orthosis (KAFO), spinal braces, and wheelchairs); breathing assistance (e.g. ventilators); and surgical remedies (e.g. tendon surgery, scoliosis surgery, installation of a pacemaker, and cardiac transplantation). The choice of specific treatment may vary and will depend upon the severity of the pain, the subject's general health, and the judgment of the attending clinician.

EXAMPLES

The following examples are to illustrate the invention. They are not meant to limit the invention in any way.

Example 1 Materials and Methods

Construction of Artificial Zinc Finger Genes with Specific Binding to Utrophin Promoter “A”

Construction of Jazz Gene

Construction of the Jazz gene was performed as described by Corbi et al., Gene Therapy 7:1076-1083, 2000, following as model the three zinc-finger motif backbone of human transcription activation factor Zif268 (Christy et al., Proc. Natl. Acad. Sci. USA 85:7857-7861, 1988) to specifically target the 9 base pair Jazz target in the utrophin promoter “A”, with the following sequence 5′-GCTGCTGCG-3′ (SEQ ID NO:9).

Construction of Bagly Gene

The Bagly four zinc-finger motif gene was constructed as described by Onori et al., Biochem Cell Biol. 85:358-365, 2007, to specifically target the 12 base pair Bagly target in the utrophin promoter “A”, with the following sequence 5′-CGGGCTGCTGCG-3′ (SEQ ID NO:12) or 5′-CCGGCTGCTGCG-3′ (SEQ ID NO:13).

Construction of UtroUp Gene

The UtroUp six zinc-finger motif gene was constructed as described by Onori et al., BMC Molec. Biol., 14:3, 2013, to specifically target the 18 base pair UtroUp target in the utrophin promoter “A”, with the following sequence 5′-GCTGCTGCGGGCTGGGAG-3′ (SEQ ID NO:14).

Construction of Genes Encoding Fusion Proteins with Specific Binding to Utrophin Promoter “A”

Construction of the Gene Encoding Vp16-Jazz Fusion Protein

Fusion of the Jazz gene with the DNA encoding Herpes transcription factor VP16 was performed as described by Mattei et al., PLoS ONE 2:e774, 2007.

Construction of the Gene Encoding SP1-Jazz Fusion Protein

Fusion of the Jazz gene with the DNA encoding human transcription factor SP1 (Kadonaga et. al., Cell 51:1079-1090, 1987) was performed as described by Corbi et al., Gene Therapy 7:1076-1083, 2000.

Construction of the Gene Encoding Gal4-Jazz Fusion Protein

Fusion of the Jazz gene with the DNA encoding transcription activation domain of yeast Gal4 protein (Sadowski, Genetic Engineering (NY), 17:119-148, 1995) was performed as described by Corbi et al., Gene Therapy 7:1076-1083, 2000.

Construction of the Gene Encoding Vp16-Bagly Fusion Protein

Fusion of the Bagly gene with the DNA encoding the Herpes transcription activation protein VP16, was performed as described by Onori et al., Biochem Cell Biol. 85:358-365, 2007.

Construction of the Gene Encoding Vp16-CJ7-UtroUp Fusion Protein

Fusion of the UtroUp gene with the DNA encoding a 100 amino acid fragment CJ7 of the human transcription activation domain CJ7, derived from the human Che-1/AATF protein (Desantis et al., Neuromusc. Disord. 21:158-162, 2009) was performed as described by Onori et al., BMC Molec. Biol., 14:3, 2013.

Design and Construction of Modified Zif268 Genes with Specific Binding to Utrophin Promoter “A”

In order to minimize any possible immune response raised against the ZF-ATF therapeutic gene product, a novel class of ZF-ATFs was developed. In this new generation of ZF-ATFs, called “modified human transcription factors”, few amino acid modifications are hidden in the natural context of a resident human gene. To this end the well characterized Zif268/EGR1 gene (FIG. 12) was selected as a starting template. Varying only few appropriate amino acids in the Zif268 alpha-helix zinc finger motifs, two novel artificial genes named: “JZif1” and “JZif2” were produced (FIG. 12). JZif1 and JZif2 bind two DNA contiguous target sequences present in both mouse and human utrophin gene promoter “A”: 5′-GCTGCTGCG-3′ (SEQ ID NO:9) and 5′-GGCTGGGAG-3′ (SEQ ID NO: 40), respectively. In particular, JZif1 binds the same DNA target sequence recognized by Jazz, while JZif2 binds the nine base pairs adjacent to the Jazz DNA target sequence. The JZif1 and JZif2 DNA binding sites are coincident with the 18 base pair DNA target sequence recognized by UtroUp (FIG. 13). Importantly, JZif1 and JZif2 can work synergistically in activating transcription, leading to higher levels of utrophin up-regulation compared to expression of one protein alone. The few amino acid changes introduced into Zif268 to obtain JZif1 and JZif2 are difficult to be recognized by the host immune system, since the novel zinc finger domains will be in the pool of the thousands of natural zinc finger proteins that vary in the alpha-helix in order to bind different DNA targets.

Due to the rationale of using modified transcription factor proteins almost identical to the genomically-encoded transcription factor in order to minimize immunogenicity, the in vivo analysis in mdx mice or treatment of human subjects preferably does not involve the use of an epitope tag fused to the JZif1 or JZif2 proteins. To follow the expression of these proteins in the absence of an epitope tag, it is possible to detect the presence of JZif1 and JZif2 mRNAs using RT-PCR and to perform Western blot or indirect immunofluorescence assays using the commercial anti-Zif268 antibody. An increase in signal in the Western blot or indirect immunofluorescence assay relative to reference can be used to detect expression of exogenous JZif1 or JZif2.

The modified human transcription factors JZif1 and JZif2 were chemically synthesized by GENEART® Gene Synthesis service (Life Technologies) which yielded DNA fragments SEQ ID NO:81 (which contains JZif1 and additional sequences for cloning) and SEQ ID NO:82 (which contains JZif2 and additional sequences for cloning), which were inserted into the vector “mAAV” (SEQ ID NO:1) giving rise to: mAAV-JZif1 (SEQ ID NO:64) and mAAV-JZif2 (SEQ ID NO:65). These DNA fragments were also inserted into the vector “pAAV”, giving rise to pAAV-JZif1 and pAAV-JZif2.

Construction of Expression Vectors for Production of Muscle-Specific Recombinant AAV

To achieve the highest degree of muscle-specific expression of artificial fusion proteins (including fusion proteins or modified transcription factors), novel muscle recombinant AAV expression vectors were constructed and combined with the use of AAV serotypes with muscle tropism. As detailed below, the Stratagene commercial pAAV vector was modified by substituting the CMV regulatory regions with upstream regulatory regions of either human alpha-actin gene or myosin light chain gene promoters. In the case of alpha-actin, further stabilization of the transcript was enabled by replacing the human alpha-actin splice acceptor site with the beta-globin intron acceptor. These novel muscle-specific pAAV expression vectors were named “mAAV.” These vectors, (e.g., the vectors described by SEQ ID NOs:1 or 2 or SEQ ID NOs:62-65) can be used using the methods described below to produce recombinant adeno-associated vectors (e.g., viral particles that comprise nucleotide sequences that allow for muscle-specific expression of artificial transcription factors of the invention). For example, the viral particles can include a polynucleotide sequence defined by SEQ ID NO:83-87, corresponding to portions of mAAV-Vp16-Jazz, mAAV-Vp16-Bagly, mAAV-Vp16-CJ7-UtroUp, mAAV-JZif1, and mAAV-JZif2, respectively.

Construction of mAAV Expression Vector with the Alpha-Actin Promoter and Beta-Globin Splice-Acceptor

A human DNA fragment from chromosome 1 (NT_167186.1), 1542 base pairs long containing the alpha-actin enhancer, promoter, and part of the first intron was amplified (from position 23087438-23088980) using the following oligonucleotides:

SEQ ID NO: 3 5′ ACGCGTCACCAACTGGGTAACCTCTGCTGA-3′ and SEQ ID NO: 4 5 GCTAGCAAGCTTACCAGGTGAACCGACTGGGTTCTG-3′.

The PCR conditions were the following: 30 cycles at 95° C. for 15 sec, 70° C. for 30 sec, 72° C. for 2 min, and a final extension at 72° C. for 10 min. The cytomegalovirus (CMV) transcription regulatory region of pAAV-hrGFP vector (Stratagene, La Jolla, Calif.) was deleted. The 1542 base pairs long DNA fragment containing the alpha-actin enhancer, promoter, and part of the first intron of the alpha-actin gene was inserted into the pAAV-hrGFP vector deleted for the CMV regulatory region.

In order to optimize tissue-specific expression, transcription efficiency, maturation and stability of the transcript expressed by the mAAV vector, a region of the splice acceptor of the first intron of the human alpha-actin was replaced by the splice acceptor of the second intron and part of the third exon of the human beta-globin gene, as follows:

Using specific oligonucleotides, a 2329 bp DNA fragment containing the splicing regulatory regions of the first intron of alpha-actin gene was deleted of its sequence acceptor splicing at the 3′ end. The deleted region was replaced with a fragment containing the sequence acceptor of the second intron and part of the third exon of the human beta-globin gene. The main elements of this vector named “mAAV” are shown in Table 1.

Recombinant pAAV Containing the Myosin Light Chain Promoter and Vp16-Jazz:

The pAAV-MLC-Vp16-Jazz vector was obtained by cloning the pMex-Vp16-Jazz construct described by Mattei et al., PloS One 22(8):e774, 2007 into the NotI sites of the pAAV-hrGFP vector (Stratagene), which was deleted of the GFP gene and all regulatory sequences contained between the two NotI sites.

Construction of Recombinant mAAV Expression Vectors Carrying Fusion Proteins and/or Modified Transcription Factors

mAAV expression vectors to express individual fusion proteins (Vp16-Jazz, Vp16-Bagly, or Vp16-CJ7-UtroUp) or individual modified human transcription factors (JZif1 or JZif2) were constructed by cloning DNA fragments containing the DNA sequences encoding each of the proteins into the polylinker site of the mAAV vector.

Production and Purification of Recombinant mAAV8 and mAAV6 Stocks

Using mAAV expression vectors that contain the artificial transcription factors of the invention (e.g. vectors described by SEQ ID NOs:2 or SEQ ID NOs:62-65) or EGFP (SEQ ID NO:80), recombinant mAAV8 and mAAV6 were generated by the triple-plasmid transfection method “AAV Helper-Free System” (Stratagene) according to the manufacturer's instructions by using AAV8 and AAV6 capsid expression plasmids, respectively. Viral particles were purified from DMEM growth medium 72 h after the triple transfection. The transfected cells' growth medium was extensively centrifuged and viral particles were concentrated in a SPECTRA-POR®FLOAT-A-LYZER® G2 Dialysis System (Sigma-Aldrich, St. Louis, Mo.) using Slide-A-Lyzer Concentrating Solution for Dialysis (Thermo Fisher Scientific, St. Waltham, Mass.). 20:1 concentrated viral suspension was dialyzed twice against physiological solution for 5 to 6 h.

Titration of Recombinant mAAV8 and mAAV6 Preparations Using Quantitative Real-Time PCR

The titer of recombinant mAAV8 and mAAV6 viral particles present in the growth medium was assessed using a quantitative real-time PCR assay. Growth medium fractions of the triple-transfected cells containing either mAAV8 or mAAV6 viral particles were pre-treated with DNase. For DNase digestion, 5 μl of the viral suspension was incubated with 35 U of DNase I (Roche Molecular Biochemicals, Mannheim, Germany) in a final volume of 90 μl of PCR buffer (50 mM KCl, 10 mM Tris-HCl pH 8, 5 mM MgCl₂) (Roche Molecular Biochemicals) at 37° C. for 30 min. DNase I was inactivated by incubation at 70° C. for 10 min. After DNase I treatment viral suspension was incubated with 10 μg Proteinase K (Roche Molecular Biochemicals), at 50° C. for 60 min. A sample of 2.5 μl was used for qPCR. Briefly in each qPCR run a standard curve was generated using serial dilution of the vector mAAV-Jazz containing one α-actin promoter per plasmid molecule (Mayginnes et al., J. Virol. Methods 137:193-204, 2006). The standard curve was generated using the plasmid mAAV-Jazz ranging from 3×10³ to 3×10¹³ copies. Each dilution step was measured in triplicate per ABI Prism run. PCR was performed using the SYBR Green DNA Master Mix (Life Technologies Corporation, Carlsbad, Calif.). PCR products were subjected to melting curve analysis using the light cycler system to exclude the amplification of unspecific products. Primers were synthesized by Life Technologies Corporation.

The following primers were used: SEQ ID NO:21 5′-CGAGCCGAGAGTAGCAGTTGTAG-3′; SEQ ID NO:22 5-′GCTAGCTAGCAAGCTTACCAGGTGAACCGACTGGGTTCTG-3′. The single-stranded nature of the mAAV8 genome as well as the double-stranded plasmid standard curve values were taken into consideration.

Animal Care

Dystrophin-deficient C57BL/10ScSn-DMDmdx/J mice (mdx) were housed under a 12-h light-dark schedule and were fed with fat-enriched rodent chow to ameliorate the low fertility of this strain. C57BL wild type control mice were also bred and housed under a 12-h light-dark schedule and were fed with standard rodent chow. All experiments were carried out in accordance with the Directive 2010/63/EU of the European Community for the care and use of laboratory animals. Housing of the animals meets the behavioral needing of the species and was supervised by the Responsible Veterinarian.

Mice Treatments

Mice were injected with mAAV8-EGFP, mAAV8-Vp16-Jazz, mAAV8-JZif1, mAAV8-JZif2, mAAV8-Vp16-CJ7-UtroUp, or mAAV8-Vp16-Bagly viruses at 5 days of age. Systemic infection was achieved by intraperitoneal (i.p.) injection of 150 μl of viral suspension (5×10¹²v.p./ml; 75 μl on each side of the lower quadrant of the abdomen) using a 0.3-ml Accu-Fine syringe (Roche Molecular Biochemicals).

RNA Extraction, RT-PCR, and Real-Time PCR

Total RNA was isolated from excised animal tissues and reverse transcribed as previously described (Di Certo et al., Hum. Mol. Genet. 19:752-760, 2010). Real-time PCR assays were performed in a 96-well format using the ABI Prism 7000 Sequence Detection System (Applied Biosystems, Foster City, Calif.). To compare the utrophin gene expression rate the amount of target gene was normalized to that of the housekeeping gene β2-microglobulin (β2M). Primers and probes for the target gene (utrophin, UTRN) and for β2M were purchased as TaqMan Gene Expression Assays (Life Technologies Corporation). The PCR reaction was performed as previously described (Mattei et al., PLos One 2:e774, 2007). The results were analyzed using Applied Biosystems analysis software. The data are expressed as the ratio between UTRN and β2M mRNA expression. A minimal number of two mice were analyzed for each category. The expression of the fusion protein or modified transcription factor mRNA in different tissues was studied with RT-PCR using the following primers:

Primers for Jazz:

SEQ ID NO: 23 5′-GTCGCCCCCCCGACCGATGTCAGC-3′; and SEQ ID NO: 24 5′-GGGCGATCCAGGATCCCCGGGAAT-3′ Primers for Jazz, Bagly and UtroUp:

SEQ ID: 76 5′-CGAGCCGAGAGTAGCA-3′; and SEQ ID: 77 5′-CATGGTGAGGTCGCCCAAGCTCT-3′ Primers for JZif1 and JZif2:

SEQ ID: 78 5′-CGAGCCGAGAGTAGCA-3′; and SEQ ID: 79 5′-GAGCTGGAGCTATTGCTTCC-3′ Primers for β2M Expression were Used to Control Amplification:

SEQ ID NO: 25 5′-TTCTGGTGCTTGTCTCACTGA-3′ and SEQ ID NO: 26 5′-CAGTATGTTCGGCTTCCCATTC-3′. PCR Products were Analyzed by Agarose Gel Electrophoresis. Western Blot Analysis

Protein extracts were obtained as previously described (Di Certo et al., supra). Fifty micrograms of protein extracts was electrophoresed through standard 6% SDS-PAGE or NUPAGE 3-8% (Life Technologies Corporation), according to the manufacturer's instructions. The following antibodies were used: anti-myc monoclonal antibody (9E10 clone, DSHB, Iowa City, IO); anti-GFP polyclonal antibody (Agilent Technologies, Santa Clara, Calif.); anti-utrophin monoclonal antibody (Santa Cruz, Santa Cruz, Calif.); anti α-tubulin monoclonal antibody (Sigma Corporation, St. Louis, Mo.), anti-laminin polyclonal antibody (Sigma Corporation); polyclonal anti-Zif268/Egr-1 (C-19): sc-189 (Santa Cruz, Santa Cruz, Calif.); and polyclonal anti-Utrophin, Abnova A01 (Germany). Immunoreactive bands probed with horseradish peroxidase-conjugated antibodies were visualized by chemiluminescence (ECL; GE Healthcare, Little Chalfont, UK), according to the manufacturer's instructions.

Histological Analysis

Tissues (transverse and cross sections) were excised from control and mAAV-treated mice as previously reported (Di Certo et al., supra). Harvested tissue samples were fixed in either in a solution of CRYO-OCT (Fisher Scientific) or in a 4% paraformaldehyde (PFA) solution in PBS. After fixation, tissue samples were immersed in sucrose 33%, and precipitated tissue was then dried and frozen at −80° C.

Hematoxylin & eosin (Roth, Karlsruhe, Germany; H&E) staining was used for evaluation of degeneration, necrotic foci and lymphocyte infiltration, following the manufacturer's instructions. The entire cross-section, taken at mid-belly, was analyzed by microscope (Olympus BX51; Tokyo, Japan). Images were captured using a digital camera at 10× magnification.

Sections were analyzed by a pathologist using a microscope (Olympus BX51; Tokyo, Japan). Images were captured using a digital camera at 10× magnification.

Measure of Cross-sectional Area

Quantification of cross-sectional area (CSA) of single muscle fibers, in various skeletal muscles from 8-week-old control or mAAV8-Vp16-Jazz, mAAV8-EGFP, or saline-injected mdx mice, was expressed in μm². Six mice per group were analyzed and at least 200 myofibers were counted in each section; data are expressed as means±SEM.

EGFP Detection

At the age of 15 days, 2 months and 8 months, mAAV8-EGFP-injected and control mdx mice were sacrificed and perfused with 4% paraformaldehyde in PBS. Cryostatic sections from muscles and other organs were observed for direct EGFP fluorescence by conventional epifluorescence microscope (Olympus BX51). Images were captured using a digital camera at 10× magnification and merged using the IAS2000 software.

Immunohistochemistry

Skeletal muscle cross-sections were subjected to indirect immunofluorescence as previously described (Di Certo et al., supra). The following primary antibodies were used: anti-laminin polyclonal antibody (Sigma Corporation); anti-CD68 monoclonal antibody (Lifespan Biosciences, Seattle, Wash.), and anti-utrophin monoclonal antibody (BD, Lab Transduction). To visualize myc-tagged protein expression, transversal sections of 6-μm thick were treated as described (Veal et al., Int. J. Biochem. Cell Biol. 30:811-821, 1998). Briefly, slides were fixed in ice-cold acetone followed by immersion in 1% H2O2 in methanol at room temperature. Sections were blocked with 10% goat serum in PBS for 30 min and dual-stained with anti-myc monoclonal antibody (9E10 clone) and anti-laminin polyclonal antibody (Sigma Corporation), in 10% goat serum in PBS at 4° C. overnight. The following secondary antibodies were used: Alexa Fluor 594 and 488 conjugated IgG (Life Technologies Corporation). Slides were mounted with ProLong Gold antifade reagent with Dapi (Life Technologies Corporation). Stained specimens were analyzed by conventional epifluorescence microscope (Olympus BX51).

Mechanical Response of Isolated Muscles and Procion Orange Uptake

Contractile activity of muscles from mAAV8-Jazz-treated and control mdx mice was examined in vitro by physiological assessment of the muscle force on isolated extensor digitorum longus (EDL) preparations of both hind limbs and of abdominal longitudinal strips (ABD). Muscle preparations were suspended in a 20-mi bath of oxygenated Krebs solution (120 mM NaCl, 25.1 mM NaHCO₃, 2.8 mM KCl, 1.2 mM KH₂PO₄, 1.2 mM MgSO₄, 1.3 mM CaCl₂, and 5 mM D-glucose) maintained at 37° C., stretched to a tension of 1.0 g and allowed to equilibrate for 30-60 min, changing the superfusion buffer every 15-20 min. Both EDL muscles, mounted vertically with their tendons intact and ABD muscles were exposed to direct field stimulation via platinum wire electrodes using single stimulations of rectilinear pulses of 0.5 msec duration at 0.05-0.2 Hz (Electric Stimulatore Digit 3 T, Lace Elettronica, Pisa, Italy). Muscle excitability was examined by varying the voltage from 0.5 to 7 V, until the supramaximal voltage was reached. Muscle mechanical activity was recorded isotonically by a strain-gauge transducer (7006 isotonic transducer) and displayed on a recording microdynamometer (Unirecord 7050, Basile, Milano, Italy). At the end of the tension recordings, EDL and ABD muscles were subjected to a period of repetitive stimulation. Muscle contractions were elicited by trains of stimuli at a frequency of 40 Hz for 250 msec every second for 3 min. Following this procedure, able to obtain the muscle fatigue, tissues were removed from the chamber and subjected to Procion Orange staining as previously described (Di Certo et al., supra).

Assays of Mice Performances by Treadmill Running and Evans Blue Dye

Treadmill studies were performed on five-lane motorized treadmill (Treadmill Model LE8710, PanLab, Cornelia, Spain) equipped with an electronic control unit, and an electric shock grid at one end of the treadmill. Shock intensity was set at 0.2-0.4 mA. Inclination of treadmill was set at 0°. Mice were subjected to three or four sessions, separated by 2 to 7 days of rest: one treadmill session was of exercise running followed by three sessions of endurance running.

Standard Protocol:

Exercise studies were performed on a five-lane motorized treadmill equipped with an electronic control unit (Treadmill Model LE8710, PanLab, Cornelia, Barcelona, Spain), and an electric shock grid at one end of the treadmill. Shock intensity was set at 0.4 mA. Inclination of the treadmill was set at 0°. During the first session, mice were familiarized to the treadmill apparatus for 2 min followed by a running with the treadmill belt speed set at 6 m/min. Each mouse was immediately removed from the treadmill after three electric shocks. One day after habituation, mice were subjected to an endurance protocol, repeated once a week for four consecutive weeks, with the treadmill belt running at accelerated speed. Mice were first acclimated to the treadmill for 2 min, followed by a running session with the treadmill belt speed initially set at 12 m/min. At 5 min after the initiation of exercise, the speed was increased by 1 m/min every 2 min and exercise continued until exhaustion. Exhaustion was defined as inability of the mouse to maintain running speed despite repeated contact with the electric grid. The time for removal of a mouse from the treadmill was 5 sec on the shocker plate without the mouse attempting to reengage the treadmill. The time to exhaustion was automatically recorded from the beginning of the running session. After the treadmill performance some mice were injected intraperitoneally with 1% of Evans Blue dye (EBD) solution in saline buffer. Muscle specimens were taken and processed as described above.

Accelerated Protocol:

Groups of 5 mice per session were first familiarized to the treadmill for 2 min with the treadmill belt stationary, followed by acclimatization with gentle walking for 2 min at a treadmill belt speed of 4 m/min, followed immediately by an exercise session for 30 min at 12 m/min. If a mouse fatigued during the 30 min exercise session and could no longer run, the following procedure was used: first, the treadmill was turned off for 2 min to give all mice on the treadmill a rest, after which the treadmill belt speed was increased to 4 m/min for 2 min, followed by an increase in the speed to 12 m/min for the remainder of the 30 min. This process was repeated each time a mouse fatigued. Two to three days after this initial habituation, mice were subjected to an endurance running, repeated three times with two to three days of rest between endurance running sessions, with belt running at accelerated speed. Mice were first acclimated to the treadmill for 2 min with the treadmill belt stationary, followed by a running session with the treadmill belt speed initially set at 12 m/min. At 5 min after the initiation of exercise, the speed was increased by 1 m/min every 2 min and exercise continued until exhaustion. Exhaustion was defined as inability to maintain running speed despite repeated contact with the electric grid. The time for removal of mice from the treadmill was 5 s on the shocker plate without attempting to reengage the treadmill. The time to exhaustion was automatically recorded from the beginning of the running session.

Evans Blue Staining

After the treadmill performance some mice were injected intraperitoneal with 1% of Evans Blue dye (EBD) solution in saline buffer. Muscle specimens were taken and processed as described above.

In Vitro Analysis of Utrophin Up-Regulation

Cells and Cell Lines:

Human HeLa cells and NIH 3T3 mouse fibroblasts were grown in Dulbecco's modified Eagle's medium (DMEM) supplemented with 10% fetal calf serum. Murine myogenic C2C7 cells were grown in 20% fetal calf serum (growth medium GM), up to confluence. Differentiation was induced by switching to differentiation medium (DM) consisting of DMEM containing 2% fetal calf serum (for transient transfection experiments) and 4% serum (for recombinant AAV infection experiments). Differentiated C2C7 cells were infected with mAAV8-EGFP, mAAV8-JZif1, or mAAV8-JZif2 viral particles purified according to the method described above (diluted 1:8 in DM medium) and processed after 10 days for Western blot and fluorescence microscopy.

HeLa cells and C2C7 cells were transiently co-transfected with the indicated plasmids and pXP-Luc, consisting of the luciferase gene under the control of about 318 base pairs of the human utrophin promoter A, as described in Onori et al., BMC Molec. Biol., 14:3, 2013 and Dennis et al., Nucleic Acid Research 24:1646-1652, 1996. Following transfection, HeLa cells were processed after 24 h, while C2C7 cells were cultured in GM for another 24 h, incubated in DM for 48 h, and then processed.

Myogenic conversion was induced in NIH 3T3 mouse fibroblasts and HeLa cells by co-transfection with a plasmid expressing the myogenic master gene MyoD under regulation of the cytomegalovirus (CMV) promoter. Following transfection, cells were incubated in differentiation medium (DM) consisting of DMEM containing 5% fetal calf serum for 72 h, followed by processing. Transient transfections were performed using LIPOFECTAMINE® based reagents (Invitrogen, Life Technologies) according to the manufacturer's instructions.

Immunofluorescence Microscopy of Cell Lines

For EGFP direct fluorescence microscopy, C2C7 cells were fixed with 4% paraformaldehyde for 10 min at room temperature. For indirect immunofluorescence microscopy, C2C7 cells were fixed with 4% paraformaldehyde for 10 min at room temperature, then permeabilized with 0.2% Nonidet P-40 in PBS for 10 minutes and incubated for 1 h with rabbit polyclonal antibody anti-Zif268/EGR-1 (sc-189, Santa Cruz, Calif.). The Alexa Fluor 594 conjugated IgG (Life Technologies Corporation, Carlsbad, Calif., USA) secondary antibody was used. Specimens were mounted with PROLONG® Gold antifade reagent (Life Technologies Corporation, Carlsbad, Calif.). Stained slides were processed using a conventional epifluorescence microscope (Olympus BX51; Tokyo, Japan).

Luciferase Assays

Luciferase was measured as described in Onori et al., BMC Molec. Biol., 14:3, 2013. Briefly, cell extracts were prepared and assayed for luciferase (LUC) activity according to the manufacturer's instructions (Promega, Madison, Wis., USA) using a Berthold LB9506 luminometer.

Example 2 Validation of Muscle-specific Recombinant AAV

As described in Example 1 and shown in FIG. 1A, the mAAV vector was used to produce different constructs expressing either the test EGFP gene (vector mAAV-EGFP, SEQ ID NO:80) or the ZF-ATF VP16-Jazz (referred to as mAAV-Jazz in FIGS. 1-4). The mAAV-EGFP construct was used to develop virus purification protocols and to tune tissue infection conditions (FIG. 1B). The AAV tissue specificity is determined by the capsid serotype, and recombinant AAV vectors can alter their tropism when coupled with different capsid serotypes. In the present example AAV serotype 8 (AAV8), which has been demonstrated to display high tropism for both skeletal and cardiac muscles (see, e.g., Wang et al., Nat. Biotechnol. 23:321-328, 2005; Gruntman et al., Curr. Protoc. Microbiol. 14: Unit 14D.3, 2013), was used. Efficiency, timing and tissue specificity of the mAAV8 infection in the mdx mice were initially assessed by the EGFP reporter gene expression. In FIG. 1C, fluorescence microscopy performed on muscle cryosections shows that mAAV8-EGFP expression was established in several mdx muscles 15 days after intraperitoneal injection. This expression appeared to be persistent after 2 months and up to 8 months, but it showed a decrease of intensity in the different muscles analyzed. Saline solution was injected as a control. Notably, no EGFP expression was detectable in the liver (FIG. 1C) or in other non-muscular tissues tested (data not shown). These results confirm the AAV8 efficacy in transducing muscles by intraperitoneal diffusion underlining the high muscle specificity of mAAV vectors. In FIG. 1D, the expression of mAAV8-EGFP was also evaluated by Western blot analysis of the indicated tissues. These results confirmed the high muscle-specificity of mAAV vectors.

Example 3 Muscle-specific Delivery of Fusion Protein Vp16-Jazz to Mdx Dystrophic Mice Upregulates Utrophin Levels

The expression of the artificial transcription factor fusion protein gene Vp16-Jazz delivered to mdx mice by mAAV8 infection was monitored by different approaches. As shown in FIG. 2A, the presence of Vp16-Jazz transcripts was determined by RT-PCR experiments using RNA extracted from abdominal and quadriceps muscles of mdx mice intraperitoneally injected either with mAAV8-Vp16-Jazz or saline solution (as a control). In FIG. 2B, Western blot analysis of total proteins extracted from skeletal muscles, heart, and liver of mAAV8-Vp16-Jazz-treated, mAAV8-EGFP-treated and control mice (saline injected) confirmed muscle-specific Jazz expression. Finally, immunohistochemical analysis on cryosections of quadriceps muscles, stained with anti-myc tag antibody, revealed the presence of Vp16-Jazz protein in the nuclei of infected myofibers (FIG. 2C).

To determine whether utrophin is up-regulated in the muscles infected by mAAV8-Vp16-Jazz, specimens from skeletal and cardiac muscles were analyzed by both real-time PCR and Western blot experiments. As shown in FIG. 2D, in tissues infected with mAAV8-Vp16-Jazz, the utrophin mRNA expression level was clearly up-regulated (up to two-fold induction). The Western blot presented in FIG. 2E shows that utrophin protein levels was up-regulated (i.e., increased) in the muscles infected with mAAV8-Vp16-Jazz. These data demonstrate that the delivery of mAAV8-Vp16-Jazz by intraperitoneal injection in mdx mice is effective and adequate to re-program utrophin expression in muscle.

Example 4 Recombinant AAV Delivery of Vp16-Jazz Improves the Histopathology of Mdx Muscle Tissue

The possible therapeutic effects of utrophin up-regulation in the muscle of mdx mice treated with mAAV8-Vp16-Jazz were evaluated by typical DMD diagnostic parameters. Analysis of the slices derived from mAAV8-Vp16-Jazz muscles shows substantial morphology/architecture amelioration in comparison with mAAV8-EGFP and control saline-injected mdx mice (FIG. 3A). The frequency of centrally nucleated myofibers (CNFs), the fiber cross-sectional area (CSA), and the extent of mononuclear inflammatory cell infiltration was also quantified in muscle slices stained with hematoxylin and eosin (H&E), and the results further demonstrated reduced degenerative/regenerative processes in mAAV8-Vp16-Jazz muscles compared to controls (FIG. 3B). Next, as shown in FIG. 3C, a reduction of the muscular inflammatory response was observed in mAAV8-Vp16-Jazz infected mdx mice compared to controls by immunostaining the dystrophic muscle with the macrophage marker CD68. These data demonstrate that delivery of mAAV8-Vp16-Jazz ameliorates the dystrophic histopathology of mdx muscle tissue.

Example 5 Recombinant AAV Delivery of Vp16-Jazz to Mdx Mice Significantly Ameliorates their Dystrophic Phenotype

Dystrophin-deficient muscles are characterized by a severe deficit in contractile force and marked susceptibility to contraction-induced injury (Di Certo et al., supra). To verify whether mAAV8-Vp16-Jazz infection improves the mechanical responses in dystrophic muscle, the contractile activity of muscles from 2-month-old mAAV8-Vp16-Jazz infected and control mdx mice was measured. Isolated abdominal muscle strips and EDL were subjected to ex vivo physiological assessment of muscle force using variable voltages until the supramaximal value was reached. As shown in FIG. 4A, muscles from mAAV8-Vp16-Jazz-treated mdx mice showed a significant increase in strength compared with muscle preparations from control mdx mice. At the end of the force test, contraction-induced injury of the sarcolemma was assessed by staining each muscle with Procion Orange dye. Uptake of this fluorescent dye into individual fibers is an index of membrane integrity loss. As shown in FIG. 4B-left, fluorescence microscopy of both abdominal and EDL muscles cross-sections showed extensive uptake of the Procion Orange dye into non-treated mdx mice compared with mAAV8-Vp16-Jazz injected mdx mice. Quantification of the percentage of dye-positive area in each section confirmed the increased ability of Jazz-expressing muscles to exclude the dye from stressed fibers (FIG. 4B, right). Altogether, these data provide physiological evidence for recovery in both contractile force and sarcolemmal integrity in mdx mice infected with mAAV8-Vp16-Jazz.

In addition to the in vitro testing, the effects of the experimental gene therapy was assessed with mAAV8-Vp16-Jazz on the overall strength of dystrophic muscles in vivo. At 3 months of age mAAV8-Vp16-Jazz mdx-treated mice and control mAAV8-EGFP mdx-treated mice or mdx non-treated mice were subjected to forced physical exercise on an accelerating treadmill. The exercise was repeated once a week for four consecutive weeks and the running time was recorded in each session. As shown in FIG. 4C, mAAV8-Vp16-Jazz mdx-treated mice scored, over the four performances, a cumulative running time of approximately 110 min before reaching exhaustion, compared to the 70 min mean performance of mdx untreated/mAAV8-EGFP-treated mice. Thus, the up-regulation of utrophin achieved by the mAAV8-mediated delivery of ZF-ATF Vp16-Jazz effectively counteracts the symptoms of dystrophic pathology, resulting in an enhanced endurance performance. Furthermore, as already demonstrated in vitro with Procion Orange dye, Vp16-Jazz-mediated utrophin over-expression preserves in vivo sarcolemmal integrity during exercise, as shown by the reduced uptake of Evan's blue dye systemically injected in mice immediately after the end of the treadmill experiments (FIG. 4D). These results demonstrate the remarkable functional recovery of overall muscle strength in mdx mice expressing the artificial transcription factor Vp16-Jazz systemically administered by the mAAV8 vector and provide important insight for the possible future clinical use of ZF-ATFs to up-regulate utrophin expression for DMD gene therapy.

Example 6 Fusion Proteins Vp16-Jazz, Vp16-Bagly, and Vp16-CJ7 UtroUp and Human Modified Transcription Factors JZif1 and JZif2 can Bind to Utrophin Promoter and Increase Downstream Luciferase Reporter Gene Expression

To investigate the ability of JZif1, JZif2, Vp16-CJ7-UtroUp, Vp16-Bagly and Vp16-Jazz to upregulate (i.e., increase) expression from the utrophin promoter “A”, their ability to drive expression of a luciferase reporter gene placed downstream of a utrophin promoter “A” fragment was tested.

To test the ability of these artificial transcription factors to increase expression from the utrophin promoter “A” in human cells, HeLa cells were co-transfected with a plasmid carrying JZif1, JZif2, Vp16-CJ7-UtroUp, Vp16-Bagly or Vp16-Jazz under control of the cytomegalovirus (CMV) promoter together with the luciferase reporter construct plasmid pXP-Luc (pXP). JZif1, JZif2, Vp16-CJ7-UtroUp, Vp16-Bagly and Vp16-Jazz were each able to increase luciferase expression relative to controls (FIG. 7). In this assay, JZif1 was identified as the best-performing artificial transcription factor, as co-transfection with a plasmid carrying JZif led to a greater than 20-fold induction of luciferase expression compared to cells transfected with pXP alone (FIG. 7). The modified human transcription factor JZif2, and the fusion proteins Vp16-CJ7-UtroUp, Vp16-Bagly, and Vp16-Jazz were also significantly effective in increasing luciferase expression (FIG. 7). By comparison, no increase in luciferase expression was detected when the pXP plasmid was co-transfected with a plasmid carrying the zinc finger domain of Jazz without a transcriptional activation domain, with a plasmid carrying the genomically-encoded human transcription factor Zif268's transcription activation domain without its zinc finger motifs, or with a plasmid carrying the transcriptional activation domain of Vp16 without a zinc finger DNA binding domain, which served as controls (FIG. 7). Likewise, there was no induction of luciferase activity when a control plasmid carrying the luciferase reporter gene under control of SV40 promoter (pGL2promoter) was used for co-transfection with the JZif1 plasmid (FIG. 7), confirming the specificity of the binding of JZif1 to the utrophin promoter “A”. These results indicate that JZif1, JZif2, Vp16-CJ7-UtroUp, Vp16-Bagly, and Vp16-Jazz can each increase expression from the utrophin promoter “A” in human cells.

Similar results were obtained in luciferase reporter assays performed in mouse cells. Murine differentiated myogenic C2C7 cells were co-transfected with pXP and plasmids carrying JZif1, JZif2, Vp16-CJ7-UtroUp, Vp16-Bagly and Vp16-Jazz under the regulation of the muscle-specific alpha-actin promoter. Each of the artificial transcription factors tested showed a significant up-regulation of luciferase expression, ranging from 2.5-fold to 5-fold, compared to the control cells transfected with pXP alone (FIG. 8). As another control, C2C7 cells transfected with a plasmid carrying the genomically-encoded wild type transcription factor Zif268 did not cause a significant increase of luciferase expression, confirming that Zif268 does not specifically bind the utrophin promoter “A”. These results indicate that JZif1, JZif2, Vp16-CJ7-UtroUp, Vp16-Bagly, and Vp16-Jazz can each increase expression from the utrophin promoter “A” in mouse cells.

In addition to the alpha-actin promoter, other muscle-specific promoters, such as the myosin light chain promoter, can drive muscle-specific expression of an artificial transcription factor. This is evidenced by the significant increase in luciferase expression measured in C2C7 myogenic cells that were co-transfected with pXP and a plasmid carrying Vp16-Jazz under the regulation of either the alpha-actin promoter (mAAV-Vp16-Jazz) or the myosin light chain (MLC) promoter (AAV-pMLC-Vp16-Jazz) (FIG. 9). In this assay, the alpha-actin promoter led to stronger induction of luciferase expression compared to the MLC promoter (FIG. 9).

Example 7 Up-regulation of Utrophin Protein Expression by Modified Human Transcription Factors JZif1 and JZif2

To confirm the expression of JZif1 and JZif2 in muscle cells after recombinant AAV infection, murine differentiated myogenic C2C7 cells were infected with mAAV8-JZif1, mAAV8-JZif2, mAAV8-JZif1 and mAAV8-JZif2, or mAAV8-EGFP (control) and subjected to Western blot analysis using the rabbit anti-Zif268 polyclonal antibody. Cells infected with mAAV8-JZif1, mAAV8-JZif2, or both, showed a significantly elevated signal compared to both un-infected controls and cells infected with mAAV8-EGFP, clearly indicating the presence of JZif1 or JZif2 over the signal of the endogenous Zif268 (FIG. 5A). Expression of EGFP or JZif1 after recombinant AAV infection was confirmed using immunofluorescence microscopy analysis of murine differentiated myogenic C2C7 cells that were infected with mAAV8-EGFP or mAAV8-JZif1, respectively (FIG. 5B).

To test whether JZif1 and JZif2 can increase utrophin protein levels, HeLa and 3T3 cells that were induced to myogenic conversion by co-transfection of MyoD were transfected with plasmids carrying JZif1 or JZif2 under regulation of the alpha-actin promoter. Both HeLa (FIG. 5C) and 3T3 (FIG. 5D) cells transfected with plasmids carrying JZif1 or JZif2 showed an increase in utrophin protein levels compared to controls.

To test whether recombinant AAV infection using mAAV8-JZif1 or mAAV8-JZif2 can cause an increase in utrophin protein levels in heart muscle, Western blot analysis of utrophin protein levels was performed on lysates of heart muscles that were isolated from 2-month old mAAV8-EGFP, mAAV8-JZif1, or mAAV8-JZif2-infected mdx mice. Detection of laminin was used to normalize the amount of proteins. Utrophin protein levels were elevated in heart muscles that were isolated from mAAV8-JZif1 or mAAV8-JZif2-infected mdx mice relative to mAAV8-EGFP-infected controls (FIG. 5E).

To test whether recombinant AAV infection using mAAV8-JZif1 or mAAV8-JZif2 can cause an increase in utrophin protein levels in skeletal muscle, Western blot analysis of utrophin protein levels was performed on lysates of diaphragm muscles that were isolated from 6-week old mdx mice that were treated with mAAV8-Vp16-Jazz, mAAV8-JZif1, or mAAV8-JZif2 viral vectors, relative to utrophin levels in diaphragm muscles isolated from untreated mdx mice. Utrophin protein expression was assessed by Western blot using the mouse polyclonal anti-utrophin antibody A01 (Abnova). Treatment of mdx mice with mAAV8-Vp16-Jazz, mAAV8-JZif1, or mAAV8-JZif2 each led to increased utrophin protein levels in the diaphragm muscles compared to untreated controls (FIG. 14).

Example 8 Recombinant AAV Delivery of Vp16-Jazz, Vp16-Bagly, Vp16-CJ7-UtroUp, and Human Modified Transcription Factors JZif1 and JZif2 to Mdx Mice Ameliorates their Dystrophic Phenotype, as Measured by Muscle Function

Dystrophin-deficient muscles are characterized by a severe deficit in contractile force and marked susceptibility to contraction-induced injury (Di Certo et al., supra). To test whether mAAV8-Vp16-Jazz, mAAV8-Vp16-Bagly, mAAV8-Vp16-CJ7-UtroUp, mAAV8-JZif1, or mAAV8-JZif2 infection improves the mechanical responses in dystrophic muscle, the contractile force (tension) of muscles from 6 week-old infected and control mdx mice, as well as that of wild type mice was measured. Isolated abdominal muscle strips and EDL were subjected to ex vivo physiological assessment of muscle force using variable voltages until the supramaximal value was reached. As shown in FIGS. 10A-10B, EDL and abdominal muscles from mdx mice treated with mAAV8-JZif1, mAAV8-Vp16-Bagly, mAAV8-Vp16-CJ7-UtroUp showed a statistically significant increase in contractile force (strength) compared with muscle preparations from control mdx mice (p<0.05 as determined by Student's t test over a wide range of voltages). Altogether, these data provide physiological evidence for recovery in both contractile force and sarcolemmal integrity in mdx mice infected with mAAV8-JZif1, mAAV8-Vp16-Bagly, mAAV8-Vp16-CJ7-UtroUp.

In addition to the ex vivo muscle contractile force testing, the effects of the experimental gene therapy on the overall strength and endurance of dystrophic muscles was tested in vivo. At 6 weeks of age, mdx mice treated with mAAV8-Vp16-Jazz, mAAV8-Vp16-Bagly, mAAV8-Vp16-CJ7-UtroUp, mAAV8-JZif1, or mAAV8-JZif2; untreated mdx mice; and wild type C57BL mice were subjected to forced physical exercise on an accelerating treadmill. The exercise was performed via the Accelerated Protocol (see Example 1). As shown in FIG. 11, mdx mice treated with each of the artificial transcription factors had significantly increased mean cumulative running time compared to untreated mdx controls, in some instances having mean cumulative running times that were comparable to wild type mice. For example, mAAV8-Vp16-Jazz-treated mdx mice scored a mean cumulative running time over the 3 running performances of approximately 95 min, while mAAV8-JZif1-treated mdx mice scored a cumulative running time of approximately 85 min, which were both significantly increased (p<0.01) compared to the approximately 55 min cumulative running time of untreated mdx mice, and comparable to the approximately 97 min mean cumulative running time of the wild type mice. mAAV8-JZif2-treated mdx mice had a cumulative running time of approximately 69 min, which was also increased compared to untreated mdx mice, although lower than that obtained by treatment with mAAV8-Vp16-Jazz or mAAV8-JZif1. mAAV8-Vp16-CJ7-UtroUp- and mAAV8-VpP16-Bagly-treated mdx mice performed very well, having cumulative running times of approximately 114 min and 104 min, respectively, which was statistically significant compared to the mdx mice (p<0.001), but not significantly different from the wild type (WT) C57BL mice.

In parallel, the uptake of Evan's Blue (EBD) was evaluated to compare skeletal muscle membrane integrity after accelerated treadmill exercise in the wild type, untreated mdx mice, and mdx mice treated with the mAAV viral particles carrying the specified artificial transcription factor. As shown in FIG. 16, mdx mice treated with mAAV8-Vp16-Jazz, mAAV8-JZif1, or mAAV8-JZif2 show significantly less EBD uptake, indicating improved membrane stability.

Thus, the up-regulation (i.e., increased expression) of utrophin achieved by the mAAV8 recombinant AAVs that comprise the genes encoding the fusion proteins and modified transcription factors described herein, as shown by Western blots, RT-PCR, and immunofluorescence analyses (FIGS. 6, 14, and 15A-B) counteracts the symptoms of dystrophic pathology, resulting in increased muscle contractile force and enhanced endurance (FIGS. 10A-10B, 11). These results demonstrate the remarkable functional recovery of overall muscle strength in mdx mice expressing the fusion proteins Vp16-Jazz, Vp16-Bagly, Vp16-CJ7-UtroUp, and the modified human transcription factors JZif1 and JZif2 systemically administered by the mAAV8 recombinant AAV, and indicate that treatment with the artificial transcription factors described herein can be used to treat muscle diseases, for example DMD and BMD.

Sequences

The following sequences are used throughout the application.

SEQ ID NO: Sequence Description  1 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAG mAAV CCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGC vector GCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA (complete CGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACTTTACCATAAG sequence) TAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCATATATCAGTGA TATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTACAAGGTGGACT GGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATATAAAGATGAGT TTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAGAGTGCCGCGT CCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGACCAGCTAAGCC TCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCAGGTCTAGCCA GTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGGACACACATAG TGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAACCTGGGGCC AGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCTTTAGGAGAC CCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTGCTCAGGCTT TGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCTCCCCGGCGC TCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAGCTGCGCGCC CTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGGTCGACGTGG CTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAGAAATGGAGT TCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTCCTGAGACTC AGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCACACGACTCC CTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAAAGAACCCG AAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCCCGAGCGCC CAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGACAGGTGCGG TTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGCGGTGACCC TCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTCAGGAGGGG CAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCGTTACCTGGG ACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCCGGGCGGGG GCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCCCAACACCCA AATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGCGCGGAGGGA ATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCGGCCACCGCA GCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCGACCAGGGCC CGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGGCAGGAGTTG GGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTGAAGGACTCC GGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTCCGCGGATTC GAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCCGTGCCAAGA GTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAATGCTTTCTT CTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTAATCTCTTTCT TTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCATTCTAAAGAA TAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTGCATATAAATA TTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGCTAATAGCAG CTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGAT TATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACCTCTTATCT TCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGCCCATCACT TTGGCAAAGAATTGGGATTCGAACATCGATGGGAATTCCGGGATCCGGTCG ACCGTACGTACAAGATCTACGGGTGGCATCCCTGTGACCCCTCCCCAGTGC CTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACCAGCCTTGTCCTAAT AAAATTAAGTTGCATCATTTTGTCTGACTAGTACGGGTGGCATCCCTGTGAC CCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACC AGCCTTGTCCTAATAAAATTAAGTTGCATCATTTTGTCTGACTAGGTGTCCTT CTATAATATTATGGGGTGGAGGGGGGTGGTATGGAGCAAGGGGCAAGTTGG GAAGACAACCTGTAGGGCCTGCGGGGTCTATTGGGAACCAAGCTGGAGTGC AGTGGCACAATCTTGGCTCACTGCAATCTCCGCCTCCTGGGTTCAAGCGATT CTCCTGCCTCAGCCTCCCGAGTTGTTGGGATTCCAGGCATGCATGACCAGG CTCAGCTAATTTTTGTTTTTTTGGTAGAGACGGGGTTTCACCATATTGGCCAG GCTGGTCTCCAACTCCTAATCTCAGGTGATCTACCCACCTTGGCCTCCCAAA TTGCTGGGATTACAGGCGTGAACCACTGCTCCCTTCCCTGTCCTTCTGATTT TGTAGGTAACCACGTGCGGACCGAGCGGCCGCAGGAACCCCTAGTGATGG AGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGA CCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGA GCGAGCGCGCAGCTGCCTGCAGGGGCGCCTGATGCGGTATTTTCTCCTTAC GCATCTGTGCGGTATTTCACACCGCATACGTCAAAGCAACCATAGTACGCGC CCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTG ACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTT CCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGC TCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACT TGATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTT CGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAA CTGGAACAACACTCAACCCTATCTCGGGCTATTCTTTTGATTTATAAGGGATT TTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAA CGCGAATTTTAACAAAATATTAACGTTTACAATTTTATGGTGCACTCTCAGTA CAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACAC CCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGAC AAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCAT CACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGG TTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGA AATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTAT CCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAA GAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCAT TTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGC TGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAG CGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGC ACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGC AAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTA CTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTA TGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGA CAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGG ATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACC AAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCG CAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAG ACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTC CGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTC GCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAG TTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGAT CGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTT TACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCT AGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTT TCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAG ATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTA CCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGG TAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCC GTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCT CTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTA CCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGC TGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACAC CGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGA AGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAG AGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTG TCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGG GGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCT GGCCTTTTGCTGGCCTTTTGCTCACATGT  2 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAG mAAV- CCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGC Vp16-Jazz GCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA vector CGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACTTTACCATAAG (complete TAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCATATATCAGTGA sequence) TATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTACAAGGTGGACT GGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATATAAAGATGAGT TTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAGAGTGCCGCGT CCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGACCAGCTAAGCC TCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCAGGTCTAGCCA GTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGGACACACATAG TGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAACCTGGGGCC AGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCTTTAGGAGAC CCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTGCTCAGGCTT TGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCTCCCCGGCGC TCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAGCTGCGCGCC CTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGGTCGACGTGG CTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAGAAATGGAGT TCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTCCTGAGACTC AGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCACACGACTCC CTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAAAGAACCCG AAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCCCGAGCGCC CAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGACAGGTGCGG TTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGCGGTGACCC TCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTCAGGAGGGG CAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCGTTACCTGGG ACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCCGGGCGGGG GCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCCCAACACCCA AATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGCGCGGAGGGA ATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCGGCCACCGCA GCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCGACCAGGGCC CGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGGCAGGAGTTG GGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTGAAGGACTCC GGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTCCGCGGATTC GAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCCGTGCCAAGA GTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAATGCTTTCTT CTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTAATCTCTTTCT TTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCATTCTAAAGAA TAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTGCATATAAATA TTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGCTAATAGCAG CTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGAT TATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACCTCTTATCT TCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGCCCATCACT TTGGCAAAGAATTGGGATTCGAACATCGATTTAAAGCTATGGAGCAAAAGCT CATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAG GACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAA TGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCT CATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGGGCGACCTCACCATG GGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCGATGTCAGCCTGGG GGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATGGCGCATGCCGACG CGCTAGACGATTTCGATCTGGACATGTTGGGGGACGGGGATTCCCCGGGTC CGGGATTTACCCCCCACGACTCCGCCCCCTACGGCGCTCTGGATATGGCCG ACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGGAATTGACGAGTACGG TGGGGAATTCCCGGGGATCCTCGATGACAGACCCTATGCTTGCCCAGTGGA AAGCTGCGACCGCCGCTTTTCTAGATCGGATGAGCTTACCCGCCATATCCG CATCCACACCGGCCAAAAACCCTTTCAATGCCGTATCTGCATGAGGAATTTC AGCAGCCGCGATGTCCTGAGGCGCCATAACAGGACCCACACAGGGGAAAA GCCATTCGCATGTGACATCTGCGGTCGAAAGTTTGCAAGCCGCGATGTCCT GAGGCGCCATAACAGGATACATTTGAGGCAAAATGATCTCGACCGTACGTAC AAGATCCGTTAGCATATGCTAACAGATCCACGGGTGGCATCCCTGTGACCC CTCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACCAG CCTTGTCCTAATAAAATTAAGTTGCATCATTTTGTCTGACTAGTACGGGTGGC ATCCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGCCACTC CAGTGCCCACCAGCCTTGTCCTAATAAAATTAAGTTGCATCATTTTGTCTGAC TAGGTGTCCTTCTATAATATTATGGGGTGGAGGGGGGTGGTATGGAGCAAG GGGCAAGTTGGGAAGACAACCTGTAGGGCCTGCGGGGTCTATTGGGAACC AAGCTGGAGTGCAGTGGCACAATCTTGGCTCACTGCAATCTCCGCCTCCTG GGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTTGTTGGGATTCCAGGC ATGCATGACCAGGCTCAGCTAATTTTTGTTTTTTTGGTAGAGACGGGGTTTCA CCATATTGGCCAGGCTGGTCTCCAACTCCTAATCTCAGGTGATCTACCCACC TTGGCCTCCCAAATTGCTGGGATTACAGGCGTGAACCACTGCTCCCTTCCCT GTCCTTCTGATTTTGTAGGTAACCACGTGCGGACCGAGCGGCCGCAGGAAC CCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTG AGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGC CTCAGTGAGCGAGCGAGCGCGCAGCTGCCTGCAGGGGCGCCTGATGCGGT ATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATACGTCAAAGCAAC CATAGTACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTT ACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTC GCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTC TAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGA CCCCAAAAAACTTGATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGA TAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGAC TCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGGCTATTCTTTTGAT TTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTA ACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTTATGGTG CACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACAC CCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATC CGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTT TTCACCGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCT ATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCA CTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACAT TCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATA TTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCT TTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAA GTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTG GATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTC CAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATT GACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGAC TTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAG TAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAA CTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCAC AACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAAT GAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCA ACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGC AACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCG CTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGA GCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTC CCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACG AAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTG TCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAAT TTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCT TAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAG GATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAA AAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACT CTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCC TTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCC TACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGAT AAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCG CAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCG AACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGC CACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGG TCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTAT CTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGT GATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCC TTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGT  3 ACGCGTCACCAACTGGGTAACCTCTGCTGA Oligo  4 GCTAGCAAGCTTACCAGGTGAACCGACTGGGTTCTG Oligo  5 CGATGGGAATTCCGGGATCCGGTCGACCGTACGTACAA Oligo  6 GATCTTGTACGTACGGTCGACCGGATCCCGGAATTCCCAT Oligo  7 ATCGATGGGAATTCCGGGATCCGGTCGACCGTACGTACAAGATCT Polylinker  8 TGGGCCCTAAAAAGAAGCGTAAA Nuclear localization sequence DNA  9 GCTGCTGCG Jazz, JZif1 target seq 10 CGGGCTGCTGCGGGCTGGGAG human target sequence for all of Jazz, Bagly, and UtroUp 11 CCGGCTGCTGCGGGCTGGGAG mouse target sequence for all of Jazz, Bagly, and UtroUp 12 CGGGCTGCTGCG human target sequence for Bagly 13 CCGGCTGCTGCG mouse target sequence for Bagly 14 GCTGCTGCGGGCTGGGAG target sequence for UtroUp 15 GGAGCCGGA Target seq for Zip51; described in background 16 CPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSSRDVLRRHNRTHTGE Zinc finger KPFACDICGRKFASRDVLRRHNRIH region Jazz 17 CPVESCDRRFSRSDNLVRHIRIHTGQKPFQCRICMRNFSRSDHLTTHNRTHTGE Zinc finger KPFACDICGRKFADPGHLVRHNRIHTGEKPFACPVESCDRRFSRSDELTRHIRIH region TGQKPFQCRICMRNFSSRDVLRRHNRTHTGEKPFACDICGRKFASRDVLRRHN UtroUp RIH 18 CPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSSRDVLRRHNRTHTGE Zinc Finger KPFACDICGRKFASRDVLRRHNRIHLRQGPRSHVCAECGKAFVESSKLKRHQLV region Bagly H 19 GCGCAGAACTTGGGGAGCCGCCGCCGCCATCCGCCGCCGCAGCCAGCTTC Zif268 CGCCGCCGCAGGACCGGCCCCTGCCCCAGCCTCCGCAGCCGCGGCGCGT mRNA CCACGCCCGCCCGCGCCCAGGGCGAGTCGGGGTCGCCGCCTGCACGCTT sequence CTCAGTGTTCCCCGCGCCCCGCATGTAACCCGGCCAGGCCCCCGCAACTGT GTCCCCTGCAGCTCCAGCCCCGGGCTGCACCCCCCCGCCCCGACACCAGC TCTCCAGCCTGCTCGTCCAGGATGGCCGCGGCCAAGGCCGAGATGCAGCT GATGTCCCCGCTGCAGATCTCTGACCCGTTCGGATCCTTTCCTCACTCGCC CACCATGGACAACTACCCTAAGCTGGAGGAGATGATGCTGCTGAGCAACGG GGCTCCCCAGTTCCTCGGCGCCGCCGGGGCCCCAGAGGGCAGCGGCAGC AACAGCAGCAGCAGCAGCAGCGGGGGCGGTGGAGGCGGCGGGGGCGGCA GCAACAGCAGCAGCAGCAGCAGCACCTTCAACCCTCAGGCGGACACGGGC GAGCAGCCCTACGAGCACCTGACCGCAGAGTCTTTTCCTGACATCTCTCTGA ACAACGAGAAGGTGCTGGTGGAGACCAGTTACCCCAGCCAAACCACTCGAC TGCCCCCCATCACCTATACTGGCCGCTTTTCCCTGGAGCCTGCACCCAACA GTGGCAACACCTTGTGGCCCGAGCCCCTCTTCAGCTTGGTCAGTGGCCTAG TGAGCATGACCAACCCACCGGCCTCCTCGTCCTCAGCACCATCTCCAGCGG CCTCCTCCGCCTCCGCCTCCCAGAGCCCACCCCTGAGCTGCGCAGTGCCAT CCAACGACAGCAGTCCCATTTACTCAGCGGCACCCACCTTCCCCACGCCGA ACACTGACATTTTCCCTGAGCCACAAAGCCAGGCCTTCCCGGGCTCGGCAG GGACAGCGCTCCAGTACCCGCCTCCTGCCTACCCTGCCGCCAAGGGTGGC TTCCAGGTTCCCATGATCCCCGACTACCTGTTTCCACAGCAGCAGGGGGAT CTGGGCCTGGGCACCCCAGACCAGAAGCCCTTCCAGGGCCTGGAGAGCCG CACCCAGCAGCCTTCGCTAACCCCTCTGTCTACTATTAAGGCCTTTGCCACT CAGTCGGGCTCCCAGGACCTGAAGGCCCTCAATACCAGCTACCAGTCCCAG CTCATCAAACCCAGCCGCATGCGCAAGTACCCCAACCGGCCCAGCAAGACG CCCCCCCACGAACGCCCTTACGCTTGCCCAGTGGAGTCCTGTGATCGCCGC TTCTCCCGCTCCGACGAGCTCACCCGCCACATCCGCATCCACACAGGCCAG AAGCCCTTCCAGTGCCGCATCTGCATGCGCAACTTCAGCCGCAGCGACCAC CTCACCACCCACATCCGCACCCACACAGGCGAAAAGCCCTTCGCCTGCGAC ATCTGTGGAAGAAAGTTTGCCAGGAGCGATGAACGCAAGAGGCATACCAAG ATCCACTTGCGGCAGAAGGACAAGAAAGCAGACAAAAGTGTTGTGGCCTCT TCGGCCACCTCCTCTCTCTCTTCCTACCCGTCCCCGGTTGCTACCTCTTACC CGTCCCCGGTTACTACCTCTTATCCATCCCCGGCCACCACCTCATACCCATC CCCTGTGCCCACCTCCTTCTCCTCTCCCGGCTCCTCGACCTACCCATCCCCT GTGCACAGTGGCTTCCCCTCCCCGTCGGTGGCCACCACGTACTCCTCTGTT CCCCCTGCTTTCCCGGCCCAGGTCAGCAGCTTCCCTTCCTCAGCTGTCACC AACTCCTTCAGCGCCTCCACAGGGCTTTCGGACATGACAGCAACCTTTTCTC CCAGGACAATTGAAATTTGCTAAAGGGAAAGGGGAAAGAAAGGGAAAAGGG AGAAAAAGAAACACAAGAGACTTAAAGGACAGGAGGAGGAGATGGCCATAG GAGAGGAGGGTTCCTCTTAGGTCAGATGGAGGTTCTCAGAGCCAAGTCCTC CCTCTCTACTGGAGTGGAAGGTCTATTGGCCAACAATCCTTTCTGCCCACTT CCCCTTCCCCAATTACTATTCCCTTTGACTTCAGCTGCCTGAAACAGCCATGT CCAAGTTCTTCACCTCTATCCAAAGAACTTGATTTGCATGGATTTTGGATAAA TCATTTCAGTATCATCTCCATCATATGCCTGACCCCTTGCTCCCTTCAATGCT AGAAAATCGAGTTGGCAAAATGGGGTTTGGGCCCCTCAGAGCCCTGCCCTG CACCCTTGTACAGTGTCTGTGCCATGGATTTCGTTTTTCTTGGGGTACTCTTG ATGTGAAGATAATTTGCATATTCTATTGTATTATTTGGAGTTAGGTCCTCACTT GGGGGAAAAAAAAAAAAGAAAAGCCAAGCAAACCAATGGTGATCCTCTATTT TGTGATGATGCTGTGACAATAAGTTTGAACCTTTTTTTTTGAAACAGCAGTCC CAGTATTCTCAGAGCATGTGTCAGAGTGTTGTTCCGTTAACCTTTTTGTAAAT ACTGCTTGACCGTACTCTCACATGTGGCAAAATATGGTTTGGTTTTTCTTTTT TTTTTTTTTTGAAAGTGTTTTTTCTTCGTCCTTTTGGTTTAAAAAGTTTCACGTC TTGGTGCCTTTTGTGTGATGCGCCTTGCTGATGGCTTGACATGTGCAATTGT GAGGGACATGCTCACCTCTAGCCTTAAGGGGGGCAGGGAGTGATGATTTGG GGGAGGCTTTGGGAGCAAAATAAGGAAGAGGGCTGAGCTGAGCTTCGGTTC TCCAGAATGTAAGAAAACAAAATCTAAAACAAAATCTGAACTCTCAAAAGTCT ATTTTTTTAACTGAAAATGTAAATTTATAAATATATTCAGGAGTTGGAATGTTG TAGTTACCTACTGAGTAGGCGGCGATTTTTGTATGTTATGAACATGCAGTTCA TTATTTTGTGGTTCTATTTTACTTTGTACTTGTGTTTGCTTAAACAAAGTGACT GTTTGGCTTATAAACACATTGAATGCGCTTTATTGCCCATGGGATATGTGGT GTATATCCTTCCAAAAAATTAAAACGAAAATAAAGTAGCTGCGATTGGG 20 AGAGACCTGTTTTGCCTAAGGGGACGTGACTCACATTTTCGGATAATCTGAA Entire TAAGGGGAATTGTGTCTGCTCGAGGCATCCATTCTGGTTCGGTCTCCGGACT Utrophin CCCGGCTCCCGGCACGCACGGTTCACTCTGGAGCGCGCGCCCCAGGCCAG promoter “A” CCAAGCGCCGAGCCGGGCTGCTGCGGGCTGGGAGGGCGCGCAGGGCCGG human DNA CGCTGATTGACGGGGCGCGCAGTCAGGTGACTTGGGGCGCCAAGTTCCCG sequence ACGCGGTGGCCGCGGTGACCGCCGAGGCCCGGCAGACGCTGACCCGGGA ACGTAGTGGGGCTGATCTTCCGGAACAAAGTTGCTGGGCCGGCGGCGGCG GGGCGAGAGCGCCGAG 21 CGAGCCGAGAGTAGCAGTTGTAG Oligo for qPCR 22 GCTAGCTAGCAAGCTTACCAGGTGAACCGACTGGGTTCTG Oligo for qPCR 23 GTCGCCCCCCCGACCGATGTCAGC Oligo 24 TTCTGGTGCTTGTCTCACTGA Oligo 25 TTCTGGTGCTTGTCTCACTGA RT PCR oligo 26 CAGTATGTTCGGCTTCCCATTC RT PCR oligo 27 GCGGCCGC NotI 28 ACGCGT MluI 29 GCCCAGGTAGGG Splice donor 30 CCCACAGCTCCT Splice acceptor 31 ATCGAT ClaI 32 AGATCT BglIII 33 AGAGACCTAGTGTGCCTAGAGGGGTGTGACACACATTTTCGGACAATTTGAA Entire TAAAGGGCACGGTGCGTGCGCGCGGTGACTATTCCAGCTTCTGGCTTCCAG Utrophin CACGCACGACTGGTTCCGGGATTCTCGCACCGCGCACCGCACGGAGCCGG promoter “A” GCTGCTGCGGGCTGGGAGGGCGCCTAGGGCTAGCGCTGATTGACCGGGC mouse DNA GCGCGGTCAGGTGACCCGAAGCGCCACGTTCTGGGAGCCCGGCCCGCGGT sequence GGCTTCCCAGGCCGGGGCAGGACCGAACCCGGAGCCGAGGGGGACTGGT CTCCCCGGGAGAACAAAGTTGCCCGGCCTGGGGCCCCGGGGCGCGCGAG CGGCGAG 34 MEQKLISEEDLNEMEQKLISEEDLNEMEQKLISEEDLNEMEQKLISEEDLNEMEQ VP16-Jazz KLISEEDLNEMESLGDLTMGPKKKRKVAPPTDVSLGDELHLDGEDVAMAHADAL protein with DDFDLDMLGDGDSPGPGFTPHDSAPYGALDMADFEFEQMFTDALGIDEYGGEF myc tag, PGILDDRPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSSRDVL NLS, VP16, RRHNRTHTGEKPFACDICGRKFASRDVLRRHNRIHLRQNDLDRTYKIR zinc fingers 35 MEQKLISEEDLNEMEQKLISEEDLNEMEQKLISEEDLNEMEQKLISEEDLNEMEQ UtroUp KLISEEDLNEMESLGDLTMGPKKKRKVAPPTDVSLGDELHLDGEDVAMAHADAL protein (myc DDFDLDMLGDGDSPGPGFTPHDSAPYGALDMADFEFEQMFTDALGIDEYGGEF tag, NLS, PGILAKRFADFTVYRNRTLQKWHDKTKLASGKLGKGFGAFERSILTQIDHILMDK VP16, CJ7, ERLLRRTQTKRSVYRVLGKPEPAAQPVPESLPGEPEILPQAPANAHLKDLDEEIF zinc fingers) DDDDFYHQLLRELIERKTSSLGILDRPYACPVESCDRRFSRSDNLVRHIRIHTGQ KPFQCRICMRNFSRSDHLTTHNRTHTGEKPFACDICGRKFADPGHLVRHNRIHT GEKPFACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSSRDVLRRHN RTHTGEKPFACDICGRKFASRDVLRRHNRIHLRQNDLERSTGGIPVTPPQCLSW PWKLPLQCPPALS 36 MEQKLISEEDLNEMEQKLISEEDLNEMEQKLISEEDLNEMEQKLISEEDLNEMEQ VP16-Bagly KLISEEDLNEMESLGDLTMGPKKKRKVAPPTDVSLGDELHLDGEDVAMAHADAL protein DDFDLDMLGDGDSPGPGFTPHDSAPYGALDMADFEFEQMFTDALGIDEYGGEF (myc, NLS, PGILDDRPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSSRDVR VP16, 4 zinc RHNRTHTGEKPFACDICGRKFASRDVLRRHNRIHLRQGPRSHVCAECGKAFVE fingers) SSKLKRHQLVHELERSPFSSRDLRVASL 37 APPTDVSLGDELHLDGEDVAMAHADALDDFDLDMLGDGDSPGPGFTPHDSAP Vp16 YGALDMADFEFEQMFTDALGIDEYGG protein seq 38 MAAAKAEMQLMSPLQISDPFGSFPHSPTMDNYPKLEEMMLLSNGAPQFLGAAG JZif1 protein APEGSGSNSSSSSSGGGGGGGGGSNSSSSSSTFNPQADTGEQPYEHLTAESF sequence PDISLNNEKVLVETSYPSQTTRLPPITYTGRFSLEPAPNSGNTLWPEPLFSLVSG (artificial LVSMTNPPASSSSAPSPAASSASASQSPPLSCAVPSNDSSPIYSAAPTFPTPNT protein) DIFPEPQSQAFPGSAGTALQYPPPAYPAAKGGFQVPMIPDYLFPQQQGDLGLG TPDQKPFQGLESRTQQPSLTPLSTIKAFATQSGSQDLKALNTSYQSQLIKPSRM RKYPNRPSKTPPHERPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMR NFSSRDVLRRHNRTHTGEKPFACDICGRKFASRDVLRRHNRIHLRQKDKKADK SVVASSATSSLSSYPSPVATSYPSPVTTSYPSPATTSYPSPVPTSFSSPGSSTYP SPVHSGFPSPSVATTYSSVPPAFPAQVSSFPSSAVTNSFSASTGLSDMTATFSP RTIEIC 39 MAAAKAEMQLMSPLQISDPFGSFPHSPTMDNYPKLEEMMLLSNGAPQFLGAAG JZif2 protein APEGSGSNSSSSSSGGGGGGGGGSNSSSSSSTFNPQADTGEQPYEHLTAESF sequence PDISLNNEKVLVETSYPSQTTRLPPITYTGRFSLEPAPNSGNTLWPEPLFSLVSG (artificial LVSMTNPPASSSSAPSPAASSASASQSPPLSCAVPSNDSSPIYSAAPTFPTPNT protein) DIFPEPQSQAFPGSAGTALQYPPPAYPAAKGGFQVPMIPDYLFPQQQGDLGLG TPDQKPFQGLESRTQQPSLTPLSTIKAFATQSGSQDLKALNTSYQSQLIKPSRM RKYPNRPSKTPPHERPYACPVESCDRRFSRSDNLVRHIRIHTGQKPFQCRICMR NFSRSDHLTTHIRTHTGEKPFACDICGRKFADPGHLVRHNRIHLRQKDKKADKS VVASSATSSLSSYPSPVATSYPSPVTTSYPSPATTSYPSPVPTSFSSPGSSTYPS PVHSGFPSPSVATTYSSVPPAFPAQVSSFPSSAVTNSFSASTGLSDMTATFSPR TIEIC 40 GGCTGGGAG JZif2 target sequence 41 MAAAKAEMQLMSPLQISDPFGSFPHSPTMDNYPKLEEMMLLSNGAPQFLGAAG Zif268 APEGSGSNSSSSSSGGGGGGGGGSNSSSSSSTFNPQADTGEQPYEHLTAESF protein PDISLNNEKVLVETSYPSQTTRLPPITYTGRFSLEPAPNSGNTLWPEPLFSLVSG human a.a. LVSMTNPPASSSSAPSPAASSASASQSPPLSCAVPSNDSSPIYSAAPTFPTPNT sequence DIFPEPQSQAFPGSAGTALQYPPPAYPAAKGGFQVPMIPDYLFPQQQGDLGLG TPDQKPFQGLESRTQQPSLTPLSTIKAFATQSGSQDLKALNTSYQSQLIKPSRM RKYPNRPSKTPPHERPYACPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMR NFSRSDHLTTHIRTHTGEKPFACDICGRKFARSDERKRHTKIHLRQKDKKADKS VVASSATSSLSSYPSPVATSYPSPVTTSYPSPATTSYPSPVPTSFSSPGSSTYPS PVHSGFPSPSVATTYSSVPPAFPAQVSSFPSSAVTNSFSASTGLSDMTATFSPR TIEIC 42 CPVESCDRRFSRSDELTRHIRIHTGQKPFQCRICMRNFSSRDVLRRHNRTHTGE Jzif1 zinc KPFACDICGRKFASRDVLRRHNRIH finger motifs a.a. seq 43 CPVESCDRRFSRSDNLVRHIRIHTGQKPFQCRICMRNFSRSDHLTTHIRTHTGE JZif2 zinc KPFACDICGRKFADPGHLVRHNRIH finger motifs a.a. seq 44 GCGTGGGCG Zif268 target sequence 45 CPVESCDRRFSRSDELTRHIRIH Zif268 ZF1 a.a. seq 46 CRICMRNFSRSDHLTTHIRTH Zif268 ZF2 a.a. seq 47 CDICGRKFARSDERKRHTKIH JZif268 ZF3 a.a. seq 48 CPVESCDRRFSRSDELTRHIRIH JZif1 ZF1 a.a. seq 49 CRICMRNFSSRDVLRRHNRTH JZif1 ZF2 a.a. seq 50 CDICGRKFASRDVLRRHNRIH JZif1 ZF3 a.a. seq 51 CPVESCDRRFSRSDNLVRHIRIH JZif2 ZF1 a.a. seq 52 CRICMRNFSRSDHLTTHIRTH JZif2 ZF2 a.a. seq 53 CDICGRKFADPGHLVRHNRIH JZif2 ZF3 a.a. seq 54 CPVESCDRRFSRSDELTRHIRIH Jazz, Bagly ZF1 a.a. seq 55 CRICMRNFSSRDVLRRHNRTH Jazz, Bagly, ZF2 a.a. seq; UtroUP ZF5 56 CDICGRKFASRDVLRRHNRIH Jazz, Bagly ZF3 a.a. seq; UtroUP ZF6 57 CAECGKAFVESSKLKRHQLVH Bagly ZF4 a.a. seq 58 CPVESCDRRFSRSDNLVRHIRIH UtroUp ZF1 a.a. seq 59 CRICMRNFSRSDHLTTHNRTH UtroUp ZF2 a.a. seq 60 CDICGRKFADPGHLVRHNRIH UtroUp ZF3 a.a. seq 61 CPVESCDRRFSRSDELTRHIRIH UtroUp ZF4 a.a. seq 62 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAG mAAV-Bagly CCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGC (complete GCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA sequence) CGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACTTTACCATAAG TAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCATATATCAGTGA TATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTACAAGGTGGACT GGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATATAAAGATGAGT TTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAGAGTGCCGCGT CCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGACCAGCTAAGCC TCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCAGGTCTAGCCA GTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGGACACACATAG TGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAACCTGGGGCC AGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCTTTAGGAGAC CCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTGCTCAGGCTT TGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCTCCCCGGCGC TCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAGCTGCGCGCC CTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGGTCGACGTGG CTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAGAAATGGAGT TCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTCCTGAGACTC AGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCACACGACTCC CTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAAAGAACCCG AAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCCCGAGCGCC CAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGACAGGTGCGG TTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGCGGTGACCC TCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTCAGGAGGGG CAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCGTTACCTGGG ACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCCGGGCGGGG GCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCCCAACACCCA AATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGCGCGGAGGGA ATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCGGCCACCGCA GCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCGACCAGGGCC CGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGGCAGGAGTTG GGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTGAAGGACTCC GGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTCCGCGGATTC GAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCCGTGCCAAGA GTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAATGCTTTCTT CTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTAATCTCTTTCT TTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCATTCTAAAGAA TAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTGCATATAAATA TTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGCTAATAGCAG CTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGAT TATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACCTCTTATCT TCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGCCCATCACT TTGGCAAAGAATTGGGATTCGAACATCGATTTAAAGCTATGGAGCAAAAGCT CATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAG GACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAA TGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCT CATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGGGCGACCTCACCATG GGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCGATGTCAGCCTGGG GGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATGGCGCATGCCGACG CGCTAGACGATTTCGATCTGGACATGTTGGGGGACGGGGATTCCCCGGGTC CGGGATTTACCCCCCACGACTCCGCCCCCTACGGCGCTCTGGATATGGCCG ACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGGAATTGACGAGTACGG TGGGGAATTCCCGGGGATCCTCGATGACAGACCCTATGCTTGCCCAGTGGA AAGCTGCGACCGCCGCTTTTCTAGATCGGATGAGCTTACCCGCCATATCCG CATCCACACCGGCCAAAAACCCTTTCAATGCCGTATCTGCATGAGGAATTTC AGCAGCCGCGATGTCCTGAGGCGCCATAACAGGACCCACACAGGGGAAAA GCCATTCGCATGTGACATCTGCGGTCGAAAGTTTGCAAGCCGCGATGTCCT GAGGCGCCATAACAGGATACATTTGAGGCAAGGTCCCAGATCTCACGTCTG TGCAGAATGTGGCAAAGCGTTCGTTGAGAGCTCAAAGCTAAAACGACACCA GCTGGTTCATGAGCTGGAGAGAAGCCCTTTTAGCTCGAGAGATCTACGGGT GGCATCCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGCCA CTCCAGTGCCCACCAGCCTTGTCCTAATAAAATTAAGTTGCATCATTTTGTCT GACTAGGTGTCCTTCTATAATATTATGGGGTGGAGGGGGGTGGTATGGAGC AAGGGGCAAGTTGGGAAGACAACCTGTAGGGCCTGCGGGGTCTATTGGGA ACCAAGCTGGAGTGCAGTGGCACAATCTTGGCTCACTGCAATCTCCGCCTC CTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTTGTTGGGATTCCA GGCATGCATGACCAGGCTCAGCTAATTTTTGTTTTTTTGGTAGAGACGGGGT TTCACCATATTGGCCAGGCTGGTCTCCAACTCCTAATCTCAGGTGATCTACC CACCTTGGCCTCCCAAATTGCTGGGATTACAGGCGTGAACCACTGCTCCCTT CCCTGTCCTTCTGATTTTGTAGGTAACCACGTGCGGACCGAGCGGCCGCAG GAACCCCTAGTGATGGAGTTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTC ACTGAGGCCGGGCGACCAAAGGTCGCCCGACGCCCGGGCTTTGCCCGGGC GGCCTCAGTGAGCGAGCGAGCGCGCAGCTGCCTGCAGGGGCGCCTGATGC GGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATACGTCAAAG CAACCATAGTACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGT GGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTC CTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCA AGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCAC CTCGACCCCAAAAAACTTGATTTGGGTGATGGTTCACGTAGTGGGCCATCGC CCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGT GGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGGCTATTCTTT TGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGA TTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTTAT GGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCC GACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCG GCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAG AGGTTTTCACCGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATA CGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGG TGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAA TACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAA TAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTAT TCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGG TGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGA ACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGT TTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCG TATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAAT GACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGA CAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGC CAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTG CACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTG AATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATG GCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCC GGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCT GCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGG TGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCC CTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAA CGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAAC TGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTT AATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATC CCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCA AAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACA AAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAA CTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGT CCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCG CCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCG ATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGC GCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGC GAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCG CCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG GTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTA TCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGT GATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCC TTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGT 63 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAG mAAV- CCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGC UtroUp GCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA (complete CGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACTTTACCATAAG sequence) TAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCATATATCAGTGA TATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTACAAGGTGGACT GGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATATAAAGATGAGT TTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAGAGTGCCGCGT CCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGACCAGCTAAGCC TCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCAGGTCTAGCCA GTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGGACACACATAG TGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAACCTGGGGCC AGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCTTTAGGAGAC CCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTGCTCAGGCTT TGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCTCCCCGGCGC TCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAGCTGCGCGCC CTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGGTCGACGTGG CTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAGAAATGGAGT TCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTCCTGAGACTC AGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCACACGACTCC CTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAAAGAACCCG AAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCCCGAGCGCC CAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGACAGGTGCGG TTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGCGGTGACCC TCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTCAGGAGGGG CAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCGTTACCTGGG ACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCCGGGCGGGG GCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCCCAACACCCA AATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGCGCGGAGGGA ATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCGGCCACCGCA GCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCGACCAGGGCC CGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGGCAGGAGTTG GGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTGAAGGACTCC GGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTCCGCGGATTC GAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCCGTGCCAAGA GTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAATGCTTTCTT CTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTAATCTCTTTCT TTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCATTCTAAAGAA TAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTGCATATAAATA TTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGCTAATAGCAG CTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGAT TATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACCTCTTATCT TCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGCCCATCACT TTGGCAAAGAATTGGGATTCGAACATCGATTTAAAGCTATGGAGCAAAAGCT CATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAG GACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAA TGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCT CATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGGGCGACCTCACCATG GGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCGATGTCAGCCTGGG GGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATGGCGCATGCCGACG CGCTAGACGATTTCGATCTGGACATGTTGGGGGACGGGGATTCCCCGGGTC CGGGATTTACCCCCCACGACTCCGCCCCCTACGGCGCTCTGGATATGGCCG ACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGGAATTGACGAGTACGG TGGGGAATTCCCGGGGATCTTGGCAAAGCGCTTTGCCGACTTTACAGTCTA CAGGAACCGCACACTTCAGAAATGGCACGATAAGACCAAACTGGCTTCTGG AAAACTGGGGAAGGGTTTTGGTGCCTTTGAACGCTCAATCTTGACTCAGATC GACCATATTCTGATGGACAAAGAGAGATTACTTCGAAGGACACAGACCAAGC GCTCTGTCTATCGAGTTCTTGGCAAACCTGAGCCAGCAGCTCAGCCTGTCC CAGAGAGTTTGCCAGGGGAACCGGAGATCCTTCCTCAAGCCCCTGCTAATG CTCATCTGAAGGACTTGGATGAAGAAATCTTTGATGATGATGACTTTTACCAC CAGCTCCTTCGAGAACTCATAGAACGGAAGACCAGCTCCTTGGGGATCCTG GATCGCCCTTACGCCTGCCCTGTGGAATCTTGCGACCGCCGGTTCTCCCGC AGCGATAACCTGGTGCGGCACATCCGGATTCACACCGGCCAGAAACCTTTC CAGTGCAGGATCTGCATGAGAAATTTCTCCCGGTCCGACCACCTGACCACC CACAATAGGACCCACACCGGCGAGAAACCCTTTGCCTGCGACATCTGCGGG AGAAAGTTCGCCGACCCCGGCCACCTGGTGAGACACAATAGAATCCACACC GGTGAAAAGCCCTTCGCCTGTCCCGTGGAGAGCTGCGATCGCAGATTCAGC CGCAGCGACGAGCTGACAAGGCACATCAGAATCCACACCGGGCAGAAGCC. TTTTCAGTGCCGGATCTGCATGAGGAACTTCAGCTCCCGGGACGTGCTGAG ACGCCACAATCGCACACACACCGGCGAAAAGCCCTTCGCCTGTGATATTTG CGGGCGGAAATTTGCCTCCAGAGATGTGCTGCGCCGCCACAACCGCATTCA CCTGAGACAGAACGATCTCGAGAGATCTACGGGTGGCATCCCTGTGACCCC TCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACCAGC CTTGTCCTAATAAAATTAAGTTGCATCATTTTGTCTGACTAGGTGTCCTTCTAT AATATTATGGGGTGGAGGGGGGTGGTATGGAGCAAGGGGCAAGTTGGGAA GACAACCTGTAGGGCCTGCGGGGTCTATTGGGAACCAAGCTGGAGTGCAGT GGCACAATCTTGGCTCACTGCAATCTCCGCCTCCTGGGTTCAAGCGATTCTC CTGCCTCAGCCTCCCGAGTTGTTGGGATTCCAGGCATGCATGACCAGGCTC AGCTAATTTTTGTTTTTTTGGTAGAGACGGGGTTTCACCATATTGGCCAGGCT GGTCTCCAACTCCTAATCTCAGGTGATCTACCCACCTTGGCCTCCCAAATTG CTGGGATTACAGGCGTGAACCACTGCTCCCTTCCCTGTCCTTCTGATTTTGT AGGTAACCACGTGCGGACCGAGCGGCCGCAGGAACCCCTAGTGATGGAGT TGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCA AAGGTCGCCCGACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCG AGCGCGCAGCTGCCTGCAGGGGCGCCTGATGCGGTATTTTCTCCTTACGCA TCTGTGCGGTATTTCACACCGCATACGTCAAAGCAACCATAGTACGCGCCCT GTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACC GCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCT TTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCC CTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGA TTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGC CCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTG GAACAACACTCAACCCTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTG CCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGC GAATTTTAACAAAATATTAACGTTTACAATTTTATGGTGCACTCTCAGTACAAT CTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGC TGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGC TGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACC GAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAAT GTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATG TGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCG CTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAG TATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTT GCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGA AGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGG TAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACT TTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAG AGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTC ACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGC AGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAA CGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATC ATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAA CGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAA ACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACT GGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGG CTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCG GTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTAT CTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGC TGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTAC TCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAG GTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTC GTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGAT CCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACC AGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTA ACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGT AGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCT GCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACC GGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTG AACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCG AACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAG GGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAG CGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTC GGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGG GGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTG GCCTTTTGCTGGCCTTTTGCTCACATGT 64 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAG mAAV-JZif1 CCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGC (complete GCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA sequence) CGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACTTTACCATAAG TAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCATATATCAGTGA TATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTACAAGGTGGACT GGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATATAAAGATGAGT TTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAGAGTGCCGCGT CCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGACCAGCTAAGCC TCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCAGGTCTAGCCA GTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGGACACACATAG TGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAACCTGGGGCC AGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCTTTAGGAGAC CCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTGCTCAGGCTT TGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCTCCCCGGCGC TCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAGCTGCGCGCC CTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGGTCGACGTGG CTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAGAAATGGAGT TCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTCCTGAGACTC AGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCACACGACTCC CTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAAAGAACCCG AAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCCCGAGCGCC CAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGACAGGTGCGG TTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGCGGTGACCC TCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTCAGGAGGGG CAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCGTTACCTGGG ACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCCGGGCGGGG GCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCCCAACACCCA AATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGCGCGGAGGGA ATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCGGCCACCGCA GCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCGACCAGGGCC CGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGGCAGGAGTTG GGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTGAAGGACTCC GGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTCCGCGGATTC GAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCCGTGCCAAGA GTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAATGCTTTCTT CTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTAATCTCTTTCT TTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCATTCTAAAGAA TAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTGCATATAAATA TTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGCTAATAGCAG CTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGAT TATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACCTCTTATCT TCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGCCCATCACT TTGGCAAAGAATTGGGATTCGAACATCGATGGTACCGAATTCAATGGCCGCT GCCAAGGCCGAGATGCAGCTGATGAGCCCCCTGCAGATCAGCGACCCCTTC GGCAGCTTCCCCCACAGCCCCACCATGGACAACTACCCCAAGCTGGAAGAG ATGATGCTGCTGAGCAATGGCGCTCCTCAGTTCCTGGGAGCCGCTGGCGCC CCTGAGGGCAGCGGCAGCAATAGCAGCAGCAGCTCTAGCGGCGGAGGCGG AGGGGGAGGCGGCGGAAGCAATAGCTCCAGCTCCAGCAGCACATTCAATC CACAAGCCGACACCGGCGAGCAGCCCTATGAGCACCTGACCGCCGAGAGC TTCCCCGACATCAGCCTGAACAACGAGAAGGTGCTGGTGGAAACCAGCTAC CCCAGCCAGACCACCCGGCTGCCCCCTATCACCTACACAGGCCGGTTCAGC CTGGAACCCGCCCCTAACAGCGGCAACACCCTGTGGCCCGAGCCCCTGTTT AGCCTGGTGTCCGGCCTGGTGTCTATGACCAACCCCCCTGCCAGCAGCTCC TCTGCCCCAAGCCCTGCCGCCAGCTCTGCCTCTGCCAGCCAGAGCCCTCCA CTGAGCTGCGCCGTGCCCAGCAACGACAGCAGCCCCATCTACAGCGCCGC TCCCACCTTCCCCACCCCCAACACCGACATCTTCCCCGAGCCTCAGAGCCA GGCCTTTCCTGGATCTGCCGGCACCGCCCTGCAGTACCCACCTCCTGCCTA TCCTGCCGCCAAGGGCGGCTTCCAGGTGCCCATGATCCCCGACTACCTGTT CCCCCAGCAGCAGGGCGATCTGGGCCTGGGCACCCCCGACCAGAAGCCTT TCCAGGGCCTCGAAAGCCGGACCCAGCAGCCAAGCCTGACCCCCCTGAGC ACCATCAAGGCCTTCGCCACCCAGAGCGGCAGCCAGGACCTGAAGGCCCT GAACACCAGCTACCAGAGCCAGCTGATCAAGCCCAGCCGGATGCGGAAGTA CCCCAACCGGCCCAGCAAGACCCCCCCACACGAGAGGCCTTACGCCTGCC CCGTGGAAAGCTGCGACAGACGGTTCAGCAGAAGCGACGAGCTGACCCGG CACATCCGGATCCACACCGGCCAGAAACCCTTCCAGTGCCGGATCTGCATG CGGAACTTCAGCAGCCGGGACGTGCTGCGGCGGCACAATAGAACCCACAC AGGCGAGAAGCCCTTCGCCTGCGACATCTGCGGCCGGAAGTTCGCCAGCA GAGATGTGCTGCGGAGACACAACAGGATCCACCTGAGACAGAAGGACAAGA AAGCCGACAAGAGCGTGGTCGCCAGCAGCGCTACCAGCAGCCTGAGCAGC TACCCTTCTCCTGTGGCCACCTCCTACCCAAGCCCAGTGACCACAAGCTACC CATCCCCCGCCACCACCTCTTATCCCAGCCCCGTGCCTACCAGCTTCAGCT CTCCCGGCAGCTCCACATACCCCAGCCCTGTGCATAGCGGCTTCCCTAGCC CTAGCGTGGCCACCACATACAGCAGCGTGCCCCCTGCCTTCCCAGCTCAAG TGTCCAGCTTCCCCAGCTCCGCCGTGACCAACAGCTTCAGCGCCAGCACCG GCCTGAGCGACATGACCGCCACCTTCAGCCCCCGGACCATCGAGATCTGCT GACTCGAGAGATCTACGGGTGGCATCCCTGTGACCCCTCCCCAGTGCCTCT CCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACCAGCCTTGTCCTAATAAAA TTAAGTTGCATCATTTTGTCTGACTAGGTGTCCTTCTATAATATTATGGGGTG GAGGGGGGTGGTATGGAGCAAGGGGCAAGTTGGGAAGACAACCTGTAGGG CCTGCGGGGTCTATTGGGAACCAAGCTGGAGTGCAGTGGCACAATCTTGGC TCACTGCAATCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCC CGAGTTGTTGGGATTCCAGGCATGCATGACCAGGCTCAGCTAATTTTTGTTT TTTTGGTAGAGACGGGGTTTCACCATATTGGCCAGGCTGGTCTCCAACTCCT AATCTCAGGTGATCTACCCACCTTGGCCTCCCAAATTGCTGGGATTACAGGC GTGAACCACTGCTCCCTTCCCTGTCCTTCTGATTTTGTAGGTAACCACGTGC GGACCGAGCGGCCGCAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCT CTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACG CCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGC CTGCAGGGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTT CACACCGCATACGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATT AAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCA GCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTT CGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCG ATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTTGGGTGATGGT TCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGG AGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAAC CCTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTA TTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAAT ATTAACGTTTACAATTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCG CATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGAC GGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCG GGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGAC GAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATG GTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCT ATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAA CCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACA TTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTG CTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTG CACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGA GTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTA TGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGC CGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAA AGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAAC CATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACC GAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTT GATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGAC ACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCG AACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGA TAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATT GCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCA CTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGG AGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCC TCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTA GATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTT TGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGT CAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCG CGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGT TTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCA GAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCA CTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTA CCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCA AGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTC GTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCT ACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGG ACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAG CTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACC TCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTAT GGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGC CTTTTGCTCACATGT 65 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAG mAAV-JZif2 CCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGC complete GCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA sequence CGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACTTTACCATAAG TAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCATATATCAGTGA TATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTACAAGGTGGACT GGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATATAAAGATGAGT TTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAGAGTGCCGCGT CCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGACCAGCTAAGCC TCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCAGGTCTAGCCA GTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGGACACACATAG TGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAACCTGGGGCC AGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCTTTAGGAGAC CCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTGCTCAGGCTT TGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCTCCCCGGCGC TCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAGCTGCGCGCC CTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGGTCGACGTGG CTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAGAAATGGAGT TCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTCCTGAGACTC AGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCACACGACTCC CTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAAAGAACCCG AAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCCCGAGCGCC CAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGACAGGTGCGG TTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGCGGTGACCC TCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTCAGGAGGGG CAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCGTTACCTGGG ACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCCGGGCGGGG GCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCCCAACACCCA AATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGCGCGGAGGGA ATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCGGCCACCGCA GCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCGACCAGGGCC CGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGGCAGGAGTTG GGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTGAAGGACTCC GGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTCCGCGGATTC GAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCCGTGCCAAGA GTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAATGCTTTCTT CTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTAATCTCTTTCT TTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCATTCTAAAGAA TAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTGCATATAAATA TTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGCTAATAGCAG CTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGAT TATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACCTCTTATCT TCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGCCCATCACT TTGGCAAAGAATTGGGATTCGAACATCGATGGTACCGAATTCAATGGCCGCT GCCAAGGCCGAGATGCAGCTGATGAGCCCCCTGCAGATCAGCGACCCCTTC GGCAGCTTCCCCCACAGCCCCACCATGGACAACTACCCCAAGCTGGAAGAG ATGATGCTGCTGAGCAATGGCGCTCCTCAGTTCCTGGGAGCCGCTGGCGCC CCTGAGGGCAGCGGCAGCAATAGCAGCAGCAGCTCTAGCGGCGGAGGCGG AGGGGGAGGCGGCGGAAGCAATAGCTCCAGCTCCAGCAGCACATTCAATC CACAAGCCGACACCGGCGAGCAGCCCTATGAGCACCTGACCGCCGAGAGC TTCCCCGACATCAGCCTGAACAACGAGAAGGTGCTGGTGGAAACCAGCTAC CCCAGCCAGACCACCCGGCTGCCCCCTATCACCTACACAGGCCGGTTCAGC CTGGAACCCGCCCCTAACAGCGGCAACACCCTGTGGCCCGAGCCCCTGTTT AGCCTGGTGTCCGGCCTGGTGTCTATGACCAACCCCCCTGCCAGCAGCTCC TCTGCCCCAAGCCCTGCCGCCAGCTCTGCCTCTGCCAGCCAGAGCCCTCCA CTGAGCTGCGCCGTGCCCAGCAACGACAGCAGCCCCATCTACAGCGCCGC TCCCACCTTCCCCACCCCCAACACCGACATCTTCCCCGAGCCTCAGAGCCA GGCCTTTCCTGGATCTGCCGGCACCGCCCTGCAGTACCCACCTCCTGCCTA TCCTGCCGCCAAGGGCGGCTTCCAGGTGCCCATGATCCCCGACTACCTGTT CCCCCAGCAGCAGGGCGATCTGGGCCTGGGCACCCCCGACCAGAAGCCTT TCCAGGGCCTCGAAAGCCGGACCCAGCAGCCAAGCCTGACCCCCCTGAGC ACCATCAAGGCCTTCGCCACCCAGAGCGGCAGCCAGGACCTGAAGGCCCT GAACACCAGCTACCAGAGCCAGCTGATCAAGCCCAGCCGGATGCGGAAGTA CCCCAACCGGCCCAGCAAGACCCCCCCACACGAGAGGCCTTACGCCTGCC CCGTGGAAAGCTGCGACAGACGGTTCAGCAGAAGCGACAACCTGGTCCGG CACATCCGGATCCACACCGGCCAGAAACCCTTCCAGTGCCGGATCTGCATG CGGAACTTCTCTCGGAGCGACCACCTGACCACCCACATCAGAACCCACACA GGCGAGAAGCCCTTCGCCTGCGACATCTGCGGCCGGAAGTTCGCCGACCC CGGCCACCTCGTCAGACACAACAGGATTCACCTGAGACAGAAGGACAAGAA AGCCGACAAGAGCGTGGTCGCCAGCAGCGCTACCAGCAGCCTGAGCAGCT ACCCTTCTCCTGTGGCCACCTCCTACCCAAGCCCAGTGACCACAAGCTACC CATCCCCCGCCACCACCTCTTATCCCAGCCCCGTGCCTACCAGCTTCAGCT CTCCCGGCAGCTCCACATACCCCAGCCCTGTGCATAGCGGCTTCCCTAGCC CTAGCGTGGCCACCACATACAGCAGCGTGCCCCCTGCCTTCCCAGCTCAAG TGTCCAGCTTCCCCAGCTCCGCCGTGACCAACAGCTTCAGCGCCAGCACCG GCCTGAGCGACATGACCGCCACCTTCAGCCCCCGGACCATCGAGATCTGCT GACTCGAGAGATCTACGGGTGGCATCCCTGTGACCCCTCCCCAGTGCCTCT CCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACCAGCCTTGTCCTAATAAAA TTAAGTTGCATCATTTTGTCTGACTAGGTGTCCTTCTATAATATTATGGGGTG GAGGGGGGTGGTATGGAGCAAGGGGCAAGTTGGGAAGACAACCTGTAGGG CCTGCGGGGTCTATTGGGAACCAAGCTGGAGTGCAGTGGCACAATCTTGGC TCACTGCAATCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCC CGAGTTGTTGGGATTCCAGGCATGCATGACCAGGCTCAGCTAATTTTTGTTT TTTTGGTAGAGACGGGGTTTCACCATATTGGCCAGGCTGGTCTCCAACTCCT AATCTCAGGTGATCTACCCACCTTGGCCTCCCAAATTGCTGGGATTACAGGC GTGAACCACTGCTCCCTTCCCTGTCCTTCTGATTTTGTAGGTAACCACGTGC GGACCGAGCGGCCGCAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTCT CTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCCGACG CCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGCTGC CTGCAGGGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTT CACACCGCATACGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATT AAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCA GCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTT CGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCG ATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTTGGGTGATGGT TCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGG AGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAAC CCTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTA TTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAAT ATTAACGTTTACAATTTTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCG CATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGAC GGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCG GGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGAC GAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATG GTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCT ATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAA CCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACA TTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTG CTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTG CACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGA GTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTA TGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGC CGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAA AGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAAC CATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACC GAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTT GATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGAC ACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCG AACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGA TAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATT GCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCA CTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGG AGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCC TCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTA GATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTT TGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGT CAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCG CGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGT TTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCA GAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCA CTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTA CCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCA AGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTC GTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCT ACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGG ACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAG CTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACC TCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTAT GGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGC CTTTTGCTCACATGT 66 ATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGC DNA TCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGA sequence GGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAA Vp16-Jazz ATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGG (includes GCGACCTCACCATGGGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCG myc tag, ATGTCAGCCTGGGGGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATG NLS, Vp16, GCGCATGCCGACGCGCTAGACGATTTCGATCTGGACATGTTGGGGGACGG Jazz) GGATTCCCCGGGTCCGGGATTTACCCCCCACGACTCCGCCCCCTACGGCG CTCTGGATATGGCCGACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGG AATTGACGAGTACGGTGGGGAATTCCCGGGGATCCTCGATGACAGACCCTA TGCTTGCCCAGTGGAAAGCTGCGACCGCCGCTTTTCTAGATCGGATGAGCT TACCCGCCATATCCGCATCCACACCGGCCAAAAACCCTTTCAATGCCGTATC TGCATGAGGAATTTCAGCAGCCGCGATGTCCTGAGGCGCCATAACAGGACC CACACAGGGGAAAAGCCATTCGCATGTGACATCTGCGGTCGAAAGTTTGCA AGCCGCGATGTCCTGAGGCGCCATAACAGGATACATTTGAGGCAAAATGAT CTCGACCGTACGTACAAGATCCGTTAG 67 ATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGC DNA TCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGA sequence GGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAA Vp16-Bagly ATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGG (includes GCGACCTCACCATGGGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCG myc tag, ATGTCAGCCTGGGGGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATG NLS, Vp16, GCGCATGCCGACGCGCTAGACGATTTCGATCTGGACATGTTGGGGGACGG Bagly) GGATTCCCCGGGTCCGGGATTTACCCCCCACGACTCCGCCCCCTACGGCG CTCTGGATATGGCCGACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGG AATTGACGAGTACGGTGGGGAATTCCCGGGGATCCTCGATGACAGACCCTA TGCTTGCCCAGTGGAAAGCTGCGACCGCCGCTTTTCTAGATCGGATGAGCT TACCCGCCATATCCGCATCCACACCGGCCAAAAACCCTTTCAATGCCGTATC TGCATGAGGAATTTCAGCAGCCGCGATGTCCTGAGGCGCCATAACAGGACC CACACAGGGGAAAAGCCATTCGCATGTGACATCTGCGGTCGAAAGTTTGCA AGCCGCGATGTCCTGAGGCGCCATAACAGGATACATTTGAGGCAAGGTCCC AGATCTCACGTCTGTGCAGAATGTGGCAAAGCGTTCGTTGAGAGCTCAAAG CTAAAACGACACCAGCTGGTTCATGAGCTGGAGAGAAGCCCTTTTAGCTCGA GAGATCTACGGGTGGCATCCCTGTGA 68 ATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGC DNA TCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGA sequence GGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAA VP16-CJ7- ATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGG UtroUp GCGACCTCACCATGGGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCG (includes ATGTCAGCCTGGGGGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATG myc tag, GCGCATGCCGACGCGCTAGACGATTTCGATCTGGACATGTTGGGGGACGG NLS, GGATTCCCCGGGTCCGGGATTTACCCCCCACGACTCCGCCCCCTACGGCG Vp16,CJ7 CTCTGGATATGGCCGACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGG UtroUp) AATTGACGAGTACGGTGGGGAATTCCCGGGGATCTTGGCAAAGCGCTTTGC CGACTTTACAGTCTACAGGAACCGCACACTTCAGAAATGGCACGATAAGACC AAACTGGCTTCTGGAAAACTGGGGAAGGGTTTTGGTGCCTTTGAACGCTCAA TCTTGACTCAGATCGACCATATTCTGATGGACAAAGAGAGATTACTTCGAAG GACACAGACCAAGCGCTCTGTCTATCGAGTTCTTGGCAAACCTGAGCCAGC AGCTCAGCCTGTCCCAGAGAGTTTGCCAGGGGAACCGGAGATCCTTCCTCA AGCCCCTGCTAATGCTCATCTGAAGGACTTGGATGAAGAAATCTTTGATGAT GATGACTTTTACCACCAGCTCCTTCGAGAACTCATAGAACGGAAGACCAGCT CCTTGGGGATCCTGGATCGCCCTTACGCCTGCCCTGTGGAATCTTGCGACC GCCGGTTCTCCCGCAGCGATAACCTGGTGCGGCACATCCGGATTCACACCG GCCAGAAACCTTTCCAGTGCAGGATCTGCATGAGAAATTTCTCCCGGTCCGA CCACCTGACCACCCACAATAGGACCCACACCGGCGAGAAACCCTTTGCCTG CGACATCTGCGGGAGAAAGTTCGCCGACCCCGGCCACCTGGTGAGACACA ATAGAATCCACACCGGTGAAAAGCCCTTCGCCTGTCCCGTGGAGAGCTGCG ATCGCAGATTCAGCCGCAGCGACGAGCTGACAAGGCACATCAGAATCCACA CCGGGCAGAAGCCTTTTCAGTGCCGGATCTGCATGAGGAACTTCAGCTCCC GGGACGTGCTGAGACGCCACAATCGCACACACACCGGCGAAAAGCCCTTC GCCTGTGATATTTGCGGGCGGAAATTTGCCTCCAGAGATGTGCTGCGCCGC CACAACCGCATTCACCTGAGACAGAACGATCTCGAGAGATCTACGGGTGGC ATCCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGCCACTC CAGTGCCCACCAGCCTTGTCCTAA 69 ATGGCCGCTGCCAAGGCCGAGATGCAGCTGATGAGCCCCCTGCAGATCAG DNA CGACCCCTTCGGCAGCTTCCCCCACAGCCCCACCATGGACAACTACCCCAA sequence GCTGGAAGAGATGATGCTGCTGAGCAATGGCGCTCCTCAGTTCCTGGGAGC encoding for CGCTGGCGCCCCTGAGGGCAGCGGCAGCAATAGCAGCAGCAGCTCTAGCG full length GCGGAGGCGGAGGGGGAGGCGGCGGAAGCAATAGCTCCAGCTCCAGCAG JZif1 CACATTCAATCCACAAGCCGACACCGGCGAGCAGCCCTATGAGCACCTGAC CGCCGAGAGCTTCCCCGACATCAGCCTGAACAACGAGAAGGTGCTGGTGGA AACCAGCTACCCCAGCCAGACCACCCGGCTGCCCCCTATCACCTACACAGG CCGGTTCAGCCTGGAACCCGCCCCTAACAGCGGCAACACCCTGTGGCCCG AGCCCCTGTTTAGCCTGGTGTCCGGCCTGGTGTCTATGACCAACCCCCCTG CCAGCAGCTCCTCTGCCCCAAGCCCTGCCGCCAGCTCTGCCTCTGCCAGCC AGAGCCCTCCACTGAGCTGCGCCGTGCCCAGCAACGACAGCAGCCCCATCT ACAGCGCCGCTCCCACCTTCCCCACCCCCAACACCGACATCTTCCCCGAGC CTCAGAGCCAGGCCTTTCCTGGATCTGCCGGCACCGCCCTGCAGTACCCAC CTCCTGCCTATCCTGCCGCCAAGGGCGGCTTCCAGGTGCCCATGATCCCCG ACTACCTGTTCCCCCAGCAGCAGGGCGATCTGGGCCTGGGCACCCCCGAC CAGAAGCCTTTCCAGGGCCTCGAAAGCCGGACCCAGCAGCCAAGCCTGAC CCCCCTGAGCACCATCAAGGCCTTCGCCACCCAGAGCGGCAGCCAGGACC TGAAGGCCCTGAACACCAGCTACCAGAGCCAGCTGATCAAGCCCAGCCGGA TGCGGAAGTACCCCAACCGGCCCAGCAAGACCCCCCCACACGAGAGGCCT TACGCCTGCCCCGTGGAAAGCTGCGACAGACGGTTCAGCAGAAGCGACGA GCTGACCCGGCACATCCGGATCCACACCGGCCAGAAACCCTTCCAGTGCCG GATCTGCATGCGGAACTTCAGCAGCCGGGACGTGCTGCGGCGGCACAATA GAACCCACACAGGCGAGAAGCCCTTCGCCTGCGACATCTGCGGCCGGAAG TTCGCCAGCAGAGATGTGCTGCGGAGACACAACAGGATCCACCTGAGACAG AAGGACAAGAAAGCCGACAAGAGCGTGGTCGCCAGCAGCGCTACCAGCAG CCTGAGCAGCTACCCTTCTCCTGTGGCCACCTCCTACCCAAGCCCAGTGAC CACAAGCTACCCATCCCCCGCCACCACCTCTTATCCCAGCCCCGTGCCTAC CAGCTTCAGCTCTCCCGGCAGCTCCACATACCCCAGCCCTGTGCATAGCGG CTTCCCTAGCCCTAGCGTGGCCACCACATACAGCAGCGTGCCCCCTGCCTT CCCAGCTCAAGTGTCCAGCTTCCCCAGCTCCGCCGTGACCAACAGCTTCAG CGCCAGCACCGGCCTGAGCGACATGACCGCCACCTTCAGCCCCCGGACCA TCGAGATCTGCTGA 70 ATGGCCGCTGCCAAGGCCGAGATGCAGCTGATGAGCCCCCTGCAGATCAG DNA CGACCCCTTCGGCAGCTTCCCCCACAGCCCCACCATGGACAACTACCCCAA sequence GCTGGAAGAGATGATGCTGCTGAGCAATGGCGCTCCTCAGTTCCTGGGAGC encoding for CGCTGGCGCCCCTGAGGGCAGCGGCAGCAATAGCAGCAGCAGCTCTAGCG full length GCGGAGGCGGAGGGGGAGGCGGCGGAAGCAATAGCTCCAGCTCCAGCAG JZif2 CACATTCAATCCACAAGCCGACACCGGCGAGCAGCCCTATGAGCACCTGAC CGCCGAGAGCTTCCCCGACATCAGCCTGAACAACGAGAAGGTGCTGGTGGA AACCAGCTACCCCAGCCAGACCACCCGGCTGCCCCCTATCACCTACACAGG CCGGTTCAGCCTGGAACCCGCCCCTAACAGCGGCAACACCCTGTGGCCCG AGCCCCTGTTTAGCCTGGTGTCCGGCCTGGTGTCTATGACCAACCCCCCTG CCAGCAGCTCCTCTGCCCCAAGCCCTGCCGCCAGCTCTGCCTCTGCCAGCC AGAGCCCTCCACTGAGCTGCGCCGTGCCCAGCAACGACAGCAGCCCCATCT ACAGCGCCGCTCCCACCTTCCCCACCCCCAACACCGACATCTTCCCCGAGC CTCAGAGCCAGGCCTTTCCTGGATCTGCCGGCACCGCCCTGCAGTACCCAC CTCCTGCCTATCCTGCCGCCAAGGGCGGCTTCCAGGTGCCCATGATCCCCG ACTACCTGTTCCCCCAGCAGCAGGGCGATCTGGGCCTGGGCACCCCCGAC CAGAAGCCTTTCCAGGGCCTCGAAAGCCGGACCCAGCAGCCAAGCCTGAC. CCCCCTGAGCACCATCAAGGCCTTCGCCACCCAGAGCGGCAGCCAGGACC TGAAGGCCCTGAACACCAGCTACCAGAGCCAGCTGATCAAGCCCAGCCGGA TGCGGAAGTACCCCAACCGGCCCAGCAAGACCCCCCCACACGAGAGGCCT TACGCCTGCCCCGTGGAAAGCTGCGACAGACGGTTCAGCAGAAGCGACAAC CTGGTCCGGCACATCCGGATCCACACCGGCCAGAAACCCTTCCAGTGCCGG ATCTGCATGCGGAACTTCTCTCGGAGCGACCACCTGACCACCCACATCAGA ACCCACACAGGCGAGAAGCCCTTCGCCTGCGACATCTGCGGCCGGAAGTTC GCCGACCCCGGCCACCTCGTCAGACACAACAGGATTCACCTGAGACAGAAG GACAAGAAAGCCGACAAGAGCGTGGTCGCCAGCAGCGCTACCAGCAGCCT GAGCAGCTACCCTTCTCCTGTGGCCACCTCCTACCCAAGCCCAGTGACCAC AAGCTACCCATCCCCCGCCACCACCTCTTATCCCAGCCCCGTGCCTACCAG CTTCAGCTCTCCCGGCAGCTCCACATACCCCAGCCCTGTGCATAGCGGCTT CCCTAGCCCTAGCGTGGCCACCACATACAGCAGCGTGCCCCCTGCCTTCCC AGCTCAAGTGTCCAGCTTCCCCAGCTCCGCCGTGACCAACAGCTTCAGCGC CAGCACCGGCCTGAGCGACATGACCGCCACCTTCAGCCCCCGGACCATCG AGATCTGCTGA 71 APPTDVSLGDELHLDGEDVAMAHADALDDFDLDMLGDGDSPGPGFTPHDSAP Minimal YGALDMADFEFEQMFTDALGIDEYGGEFPGILDDRPYACPVESCDRRFSRSDEL Jazz (Vp16- TRHIRIHTGQKPFQCRICMRNFSSRDVLRRHNRTHTGEKPFACDICGRKFASRD Jazz, no tag, VLRRHNRIH NLS) protein seq 72 APPTDVSLGDELHLDGEDVAMAHADALDDFDLDMLGDGDSPGPGFTPHDSAP Minimal YGALDMADFEFEQMFTDALGIDEYGGEFPGILDDRPYACPVESCDRRFSRSDEL Bagly TRHIRIHTGQKPFQCRICMRNFSSRDVLRRHNRTHTGEKPFACDICGRKFASRD (Vp16- VLRRHNRIHLRQGPRSHVCAECGKAFVESSKLKRHQLVH Bagly) protein seq 73 APPTDVSLGDELHLDGEDVAMAHADALDDFDLDMLGDGDSPGPGFTPHDSAP Minimal YGALDMADFEFEQMFTDALGIDEYGGEFPGILAKRFADFTVYRNRTLQKWHDKT UtroUp KLASGKLGKGFGAFERSILTQIDHILMDKERLLRRTQTKRSVYRVLGKPEPAAQP (Vp16-CJ7- VPESLPGEPEILPQAPANAHLKDLDEEIFDDDDFYHQLLRELIERKTSSLGILDRP UtroUp) YACPVESCDRRFSRSDNLVRHIRIHTGQKPFQCRICMRNFSRSDHLTTHNRTHT protein seq GEKPFACDICGRKFADPGHLVRHNRIHTGEKPFACPVESCDRRFSRSDELTRHI RIHTGQKPFQCRICMRNFSSRDVLRRHNRTHTGEKPFACDICGRKFASRDVLRR HNRIH 74 AKRFADFTVYRNRTLQKWHDKTKLASGKLGKGFGAFERSILTQIDHILMDKERLL CJ7 amino RRTQTKRSVYRVLGKPEPAAQPVPESLPGEPEILPQAPANAHLKDLDEEIFDDD acid DFYHQLLRELIERKTSSL sequence 75 MEQKLISEEDLNEMEQKLISEEDLNEMEQKLISEEDLNEMEQKLISEEDLNEMEQ Multi myc KLISEEDLNEMESLGDLT tag 76 CGAGCCGAGAGTAGCA 77 CATGGTGAGGTCGCCCAAGCTCT 78 CGAGCCGAGAGTAGCA 79 GAGCTGGAGCTATTGCTTCC 80 CCTGCAGGCAGCTGCGCGCTCGCTCGCTCACTGAGGCCGCCCGGGCAAAG mAAV-GFP CCCGGGCGTCGGGCGACCTTTGGTCGCCCGGCCTCAGTGAGCGAGCGAGC sequence GCGCAGAGAGGGAGTGGCCAACTCCATCACTAGGGGTTCCTGCGGCCGCA (complete CGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACTTTACCATAAG sequence) TAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCATATATCAGTGA TATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTACAAGGTGGACT GGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATATAAAGATGAGT TTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAGAGTGCCGCGT CCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGACCAGCTAAGCC TCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCAGGTCTAGCCA GTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGGACACACATAG TGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAACCTGGGGCC AGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCTTTAGGAGAC CCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTGCTCAGGCTT TGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCTCCCCGGCGC TCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAGCTGCGCGCC CTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGGTCGACGTGG CTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAGAAATGGAGT TCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTCCTGAGACTC AGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCACACGACTCC CTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAAAGAACCCG AAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCCCGAGCGCC CAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGACAGGTGCGG TTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGCGGTGACCC TCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTCAGGAGGGG CAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCGTTACCTGGG ACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCCGGGCGGGG GCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCCCAACACCCA AATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGCGCGGAGGGA ATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCGGCCACCGCA GCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCGACCAGGGCC CGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGGCAGGAGTTG GGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTGAAGGACTCC GGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTGTTTAGTGAA CCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAG ACACCGGGACCGATCCAGCCTCCGCGGATTCGAATCCCGGCCGGGAACGG TGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTAT AGAGTCTATAGGCCCACAAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTT ATCTTATTTCTAATACTTTCCCTAATCTCTTTCTTTCAGGGCAATAATGATACA ATGTATCATGCCTCTTTGCACCATTCTAAAGAATAACAGTGATAATTTCTGGG TTAAGGCAATAGCAATATTTCTGCATATAAATATTTCTGCATATAAATTGTAAC TGATGTAAGAGGTTTCATATTGCTAATAGCAGCTACAATCCAGCTACCATTCT GCTITTATTTTATGGTTGGGATAAGGCTGGATTATTCTGAGTCCAAGCTAGG CCCTTTTGCTAATCATGTTCATACCTCTTATCTTCCTCCCACAGCTCCTGGGC AACGTGCTGGTCTGTGTGCTGGCCCATCACTTTGGCAAAGAATTGGGATTCG AACATCGATTTAAAGCTATGGTGAGTAAACAAATATTGAAGAACACTGGATTG CAGGAGATCATGTCGTTTAAAGTGAATCTGGAAGGTGTAGTAAACAATCATG TGTTCACAATGGAAGGTTGTGGAAAAGGAAATATTTTATTCGGAAACCAACT GGTTCAGATTCGTGTCACAAAAGGGGCTCCGCTTCCATTTGCATTTGATATT CTCTCACCAGCTTTCCAATACGGCAACCGTACATTCACGAAATACCCGGAGG ATATATCAGACTTTTTTATACAATCATTTCCAGCGGGATTTGTATACGAAAGA ACGTTGCGTTACGAAGATGGTGGACTGGTTGAAATCCGTTCAGATATAAATT TAATCGAGGAGATGTTTGTCTACAGAGTGGAATATAAAGGTAGTAACTTCCC GAATGATGGTCCAGTGATGAAGAAGACAATCACAGGATTACAACCTTCGTTC GAAGTTGTGTATATGAACGATGGCGTCTTGGTTGGCCAAGTCATTCTTGTTT ATAGATTAAACTCTGGCAAATTTTATTCGTGTCACATGAGAACACTGATGAAA TCAAAGGGTGTAGTGAAGGATTTTCCCGAATACCATTTCATTCAACATCGTTT AGAGAAGACGTATGTGGAAGACGGAGGTTTTGTTGAGCAACACGAGACGGC CATTGCTCAACTGACATCGCTGGGGAAACCACTTGGATCCTTACACGAATGG GTTTAACTCGAGAGATCTACGGGTGGCATCCCTGTGACCCCTCCCCAGTGC CTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACCAGCCTTGTCCTAAT AAAATTAAGTTGCATCATTTTGTCTGACTAGGTGTCCTTCTATAATATTATGG GGTGGAGGGGGGTGGTATGGAGCAAGGGGCAAGTTGGGAAGACAACCTGT AGGGCCTGCGGGGTCTATTGGGAACCAAGCTGGAGTGCAGTGGCACAATCT TGGCTCACTGCAATCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGC CTCCCGAGTTGTTGGGATTCCAGGCATGCATGACCAGGCTCAGCTAATTTTT GTTTTTTTGGTAGAGACGGGGTTTCACCATATTGGCCAGGCTGGTCTCCAAC TCCTAATCTCAGGTGATCTACCCACCTTGGCCTCCCAAATTGCTGGGATTAC AGGCGTGAACCACTGCTCCCTTCCCTGTCCTTCTGATTTTGTAGGTAACCAC GTGCGGACCGAGCGGCCGCAGGAACCCCTAGTGATGGAGTTGGCCACTCC CTCTCTGCGCGCTCGCTCGCTCACTGAGGCCGGGCGACCAAAGGTCGCCC GACGCCCGGGCTTTGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAG CTGCCTGCAGGGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGG TATTTCACACCGCATACGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGC GCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTT GCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCA CGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGT TCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTTGGGTGA TGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACG TTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACT CAACCCTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGG CCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACA AAATATTAACGTTTACAATTTTATGGTGCACTCTCAGTACAATCTGCTCTGAT GCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCC CTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGT CTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGC GAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATA ATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAA CCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGAC AATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATT CAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGT TTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTG GGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTT GAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCT GCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGG TCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACA GAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCA TAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGG ACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGC CTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGT GACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTG GCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGC GGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTT TATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGC AGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGAC GGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGG TGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATAC TTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCC TTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGA GCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCT GCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGT TTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTC AGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCC ACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCT GTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGA CTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGG GTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGAT ACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGG CGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGG GAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGC CACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGC CTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCT GGCCTTTTGCTCACATGT 81 GAATTCAATGGCCGCGGCCAAGGCCGAGATGCAGCTGATGTCCCCGCTGCA Synthesized GATCTCTGACCCGTTCGGATCCTTTCCTCACTCGCCCACCATGGACAACTAC JZif1 CCTAAGCTGGAGGAGATGATGCTGCTGAGCAACGGGGCTCCCCAGTTCCTC construct to GGCGCCGCCGGGGCCCCAGAGGGCAGCGGCAGCAACAGCAGCAGCAGCA insert into GCAGCGGGGGCGGTGGAGGCGGCGGGGGCGGCAGCAACAGCAGCAGCA mAAV GCAGCAGCACCTTCAACCCTCAGGCGGACACGGGCGAGCAGCCCTACGAG includes CACCTGACCGCAGAGTCTTTTCCTGACATCTCTCTGAACAACGAGAAGGTGC cloning TGGTGGAGACCAGTTACCCCAGCCAAACCACTCGACTGCCCCCCATCACCT sequences ATACTGGCCGCTTTTCCCTGGAGCCTGCACCCAACAGTGGCAACACCTTGT GGCCCGAGCCCCTCTTCAGCTTGGTCAGTGGCCTAGTGAGCATGACCAACC CACCGGCCTCCTCGTCCTCAGCACCATCTCCAGCGGCCTCCTCCGCCTCCG CCTCCCAGAGCCCACCCCTGAGCTGCGCAGTGCCATCCAACGACAGCAGTC CCATTTACTCAGCGGCACCCACCTTCCCCACGCCGAACACTGACATTTTCCC TGAGCCACAAAGCCAGGCCTTCCCGGGCTCGGCAGGGACAGCGCTCCAGT ACCCGCCTCCTGCCTACCCTGCCGCCAAGGGTGGCTTCCAGGTTCCCATGA TCCCCGACTACCTGTTTCCACAGCAGCAGGGGGATCTGGGCCTGGGCACCC CAGACCAGAAGCCCTTCCAGGGCCTGGAGAGCCGCACCCAGCAGCCTTCG CTAACCCCTCTGTCTACTATTAAGGCCTTTGCCACTCAGTCGGGCTCCCAGG ACCTGAAGGCCCTCAATACCAGCTACCAGTCCCAGCTCATCAAACCCAGCC GCATGCGCAAGTACCCCAACCGGCCCAGCAAGACGCCCCCCCACGAACGC CCTTACGCTTGCCCAGTGGAGTCCTGTGATCGCCGCTTCTCCCGCTCCGAC GAGCTCACCCGCCACATCCGCATCCACACAGGCCAGAAGCCCTTCCAGTGC CGCATCTGCATGCGCAACTTCAGCAGCCGCGACGTCCTCAGGCGCCACAAC CGCACCCACACAGGCGAAAAGCCCTTCGCCTGCGACATCTGTGGAAGAAAG TTTGCCAGCAGGGATGTCCTGAGGAGGCATAACAGGATCCACTTGCGGCAG AAGGACAAGAAAGCAGACAAAAGTGTTGTGGCCTCTTCGGCCACCTCCTCT CTCTCTTCCTACCCGTCCCCGGTTGCTACCTCTTACCCGTCCCCGGTTACTA CCTCTTATCCATCCCCGGCCACCACCTCATACCCATCCCCTGTGCCCACCTC CTTCTCCTCTCCCGGCTCCTCGACCTACCCATCCCCTGTGCACAGTGGCTTC CCCTCCCCGTCGGTGGCCACCACGTACTCCTCTGTTCCCCCTGCTTTCCCG GCCCAGGTCAGCAGCTTCCCTTCCTCAGCTGTCACCAACTCCTTCAGCGCC TCCACAGGGCTTTCGGACATGACAGCAACCTTTTCTCCCAGGACAATTGAAA TTTGCTAACTCGAG 82 GAATTCAATGGCCGCGGCCAAGGCCGAGATGCAGCTGATGTCCCCGCTGCA Synthesized GATCTCTGACCCGTTCGGATCCTTTCCTCACTCGCCCACCATGGACAACTAC JZif2 CCTAAGCTGGAGGAGATGATGCTGCTGAGCAACGGGGCTCCCCAGTTCCTC construct to GGCGCCGCCGGGGCCCCAGAGGGCAGCGGCAGCAACAGCAGCAGCAGCA insert into GCAGCGGGGGCGGTGGAGGCGGCGGGGGCGGCAGCAACAGCAGCAGCA mAAV GCAGCAGCACCTTCAACCCTCAGGCGGACACGGGCGAGCAGCCCTACGAG includes CACCTGACCGCAGAGTCTTTTCCTGACATCTCTCTGAACAACGAGAAGGTGC cloning TGGTGGAGACCAGTTACCCCAGCCAAACCACTCGACTGCCCCCCATCACCT sequences ATACTGGCCGCTTTTCCCTGGAGCCTGCACCCAACAGTGGCAACACCTTGT GGCCCGAGCCCCTCTTCAGCTTGGTCAGTGGCCTAGTGAGCATGACCAACC CACCGGCCTCCTCGTCCTCAGCACCATCTCCAGCGGCCTCCTCCGCCTCCG CCTCCCAGAGCCCACCCCTGAGCTGCGCAGTGCCATCCAACGACAGCAGTC CCATTTACTCAGCGGCACCCACCTTCCCCACGCCGAACACTGACATTTTCCC TGAGCCACAAAGCCAGGCCTTCCCGGGCTCGGCAGGGACAGCGCTCCAGT ACCCGCCTCCTGCCTACCCTGCCGCCAAGGGTGGCTTCCAGGTTCCCATGA TCCCCGACTACCTGTTTCCACAGCAGCAGGGGGATCTGGGCCTGGGCACCC CAGACCAGAAGCCCTTCCAGGGCCTGGAGAGCCGCACCCAGCAGCCTTCG CTAACCCCTCTGTCTACTATTAAGGCCTTTGCCACTCAGTCGGGCTCCCAGG ACCTGAAGGCCCTCAATACCAGCTACCAGTCCCAGCTCATCAAACCCAGCC GCATGCGCAAGTACCCCAACCGGCCCAGCAAGACGCCCCCCCACGAACGC CCTTACGCTTGCCCAGTGGAGTCCTGTGATCGCCGCTTCTCCCGCTCCGAC GAGCTCACCCGCCACATCCGCATCCACACAGGCCAGAAGCCCTTCCAGTGC CGCATCTGCATGCGCAACTTCAGCAGCCGCGACGTCCTCAGGCGCCACAAC CGCACCCACACAGGCGAAAAGCCCTTCGCCTGCGACATCTGTGGAAGAAAG TTTGCCAGCAGGGATGTCCTGAGGAGGCATAACAGGATCCACTTGCGGCAG AAGGACAAGAAAGCAGACAAAAGTGTTGTGGCCTCTTCGGCCACCTCCTCT CTCTCTTCCTACCCGTCCCCGGTTGCTACCTCTTACCCGTCCCCGGTTACTA CCTCTTATCCATCCCCGGCCACCACCTCATACCCATCCCCTGTGCCCACCTC CTTCTCCTCTCCCGGCTCCTCGACCTACCCATCCCCTGTGCACAGTGGCTTC CCCTCCCCGTCGGTGGCCACCACGTACTCCTCTGTTCCCCCTGCTTTCCCG GCCCAGGTCAGCAGCTTCCCTTCCTCAGCTGTCACCAACTCCTTCAGCGCC TCCACAGGGCTTTCGGACATGACAGCAACCTTTTCTCCCAGGACAATTGAAA TTTGCTAACTCGAG 83 GCGGCCGCACGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACT Sequence of TTACCATAAGTAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCAT mAAV- ATATCAGTGATATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTAC Vp16-Jazz AAGGTGGACTGGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATAT that is AAAGATGAGTTTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAG incorporated AGTGCCGCGTCCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGAC into viral CAGCTAAGCCTCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCA particle and GGTCTAGCCAGTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGG delivered to ACACACATAGTGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAA humans or CCTGGGGCCAGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCT mice TTAGGAGACCCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTG CTCAGGCTTTGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCT CCCCGGCGCTCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAG CTGCGCGCCCTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGG TCGACGTGGCTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAG AAATGGAGTTCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTC CTGAGACTCAGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCAC ACGACTCCCTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAA AGAACCCGAAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCC CGAGCGCCCAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGAC AGGTGCGGTTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGC GGTGACCCTCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTC AGGAGGGGCAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCG TTACCTGGGACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCC GGGCGGGGGCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCC CAACACCCAAATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGC GCGGAGGGAATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCG GCCACCGCAGCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCG ACCAGGGCCCGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGG CAGGAGTTGGGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTG AAGGACTCCGGGGCAGAACCCAGTCGGITCACCTGGTAAGCTTGCTAGCTC CGCGGATTCGAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCC GTGCCAAGAGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAA TGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTA ATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCA TTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTG CATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGC TAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATA AGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATAC CTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGC CCATCACTTTGGCAAAGAATTGGGATTCGAACATCGATTTAAAGCTATGGAG CAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTT CTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTT GAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAG CAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGGGCGACC TCACCATGGGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCGATGTCA GCCTGGGGGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATGGCGCAT GCCGACGCGCTAGACGATTTCGATCTGGACATGTTGGGGGACGGGGATTCC CCGGGTCCGGGATTTACCCCCCACGACTCCGCCCCCTACGGCGCTCTGGAT ATGGCCGACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGGAATTGACG AGTACGGTGGGGAATTCCCGGGGATCCTCGATGACAGACCCTATGCTTGCC CAGTGGAAAGCTGCGACCGCCGCTTTTCTAGATCGGATGAGCTTACCCGCC ATATCCGCATCCACACCGGCCAAAAACCCTTTCAATGCCGTATCTGCATGAG GAATTTCAGCAGCCGCGATGTCCTGAGGCGCCATAACAGGACCCACACAGG GGAAAAGCCATTCGCATGTGACATCTGCGGTCGAAAGTTTGCAAGCCGCGA TGTCCTGAGGCGCCATAACAGGATACATTTGAGGCAAAATGATCTCGACCGT ACGTACAAGATCCGTTAGCATATGCTAACAGATCCACGGGTGGCATCCCTGT GACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCC ACCAGCCTTGTCCTAATAAAATTAAGTTGCATCATTTTGTCTGACTAGTACGG GTGGCATCCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGC CACTCCAGTGCCCACCAGCCTTGTCCTAATAAAATTAAGTTGCATCATTTTGT CTGACTAGGTGTCCTTCTATAATATTATGGGGTGGAGGGGGGTGGTATGGA GCAAGGGGCAAGTTGGGAAGACAACCTGTAGGGCCTGCGGGGTCTATTGG GAACCAAGCTGGAGTGCAGTGGCACAATCTTGGCTCACTGCAATCTCCGCC TCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTTGTTGGGATTCC AGGCATGCATGACCAGGCTCAGCTAATTTTTGTTTTTTTGGTAGAGACGGGG TTTCACCATATTGGCCAGGCTGGTCTCCAACTCCTAATCTCAGGTGATCTAC CCACCTTGGCCTCCCAAATTGCTGGGATTACAGGCGTGAACCACTGCTCCC TTCCCTGTCCTTCTGATTTTGTAGGTAACCACGTGCGGACCGAGCGGCCGC 84 GCGGCCGCACGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACT Sequence of TTACCATAAGTAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCAT mAAV- ATATCAGTGATATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTAC Vp16-Bagly AAGGTGGACTGGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATAT that is AAAGATGAGTTTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAG incorporated AGTGCCGCGTCCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGAC into viral CAGCTAAGCCTCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCA particle and GGTCTAGCCAGTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGG delivered to ACACACATAGTGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAA humans or CCTGGGGCCAGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCT mice TTAGGAGACCCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTG CTCAGGCTTTGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCT CCCCGGCGCTCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAG CTGCGCGCCCTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGG TCGACGTGGCTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAG AAATGGAGTTCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTC CTGAGACTCAGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCAC ACGACTCCCTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAA AGAACCCGAAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCC CGAGCGCCCAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGAC AGGTGCGGTTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGC GGTGACCCTCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTC AGGAGGGGCAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCG TTACCTGGGACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCC GGGCGGGGGCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCC CAACACCCAAATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGC GCGGAGGGAATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCG GCCACCGCAGCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCG ACCAGGGCCCGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGG CAGGAGTTGGGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTG AAGGACTCCGGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTC CGCGGATTCGAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCC GTGCCAAGAGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAA TGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTA ATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCA TTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTG CATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGC TAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATA AGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATAC CTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGC CCATCACTTTGGCAAAGAATTGGGATTCGAACATCGATTTAAAGCTATGGAG CAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTT CTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTT GAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAG CAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGGGCGACC TCACCATGGGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCGATGTCA GCCTGGGGGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATGGCGCAT GCCGACGCGCTAGACGATTTCGATCTGGACATGTTGGGGGACGGGGATTCC CCGGGTCCGGGATTTACCCCCCACGACTCCGCCCCCTACGGCGCTCTGGAT ATGGCCGACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGGAATTGACG AGTACGGTGGGGAATTCCCGGGGATCCTCGATGACAGACCCTATGCTTGCC CAGTGGAAAGCTGCGACCGCCGCTTTTCTAGATCGGATGAGCTTACCCGCC ATATCCGCATCCACACCGGCCAAAAACCCTTTCAATGCCGTATCTGCATGAG GAATTTCAGCAGCCGCGATGTCCTGAGGCGCCATAACAGGACCCACACAGG GGAAAAGCCATTCGCATGTGACATCTGCGGTCGAAAGTTTGCAAGCCGCGA TGTCCTGAGGCGCCATAACAGGATACATTTGAGGCAAGGTCCCAGATCTCA CGTCTGTGCAGAATGTGGCAAAGCGTTCGTTGAGAGCTCAAAGCTAAAACG ACACCAGCTGGTTCATGAGCTGGAGAGAAGCCCTTTTAGCTCGAGAGATCTA CGGGTGGCATCCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGT TGCCACTCCAGTGCCCACCAGCCTTGTCCTAATAAAATTAAGTTGCATCATTT TGTCTGACTAGGTGTCCTTCTATAATATTATGGGGTGGAGGGGGGTGGTATG GAGCAAGGGGCAAGTTGGGAAGACAACCTGTAGGGCCTGCGGGGTCTATT GGGAACCAAGCTGGAGTGCAGTGGCACAATCTTGGCTCACTGCAATCTCCG CCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTTGTTGGGATT CCAGGCATGCATGACCAGGCTCAGCTAATTTTTGTTTTTTTGGTAGAGACGG GGTTTCACCATATTGGCCAGGCTGGTCTCCAACTCCTAATCTCAGGTGATCT ACCCACCTTGGCCTCCCAAATTGCTGGGATTACAGGCGTGAACCACTGCTC CCTTCCCTGTCCTTCTGATTTTGTAGGTAACCACGTGCGGACCGAGCGGCC GC 85 GCGGCCGCACGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACT Sequence of TTACCATAAGTAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCAT mAAV- ATATCAGTGATATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTAC Vp16-CJ7- AAGGTGGACTGGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATAT UtroUp that AAAGATGAGTTTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAG is AGTGCCGCGTCCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGAC incorporated CAGCTAAGCCTCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCA into viral GGTCTAGCCAGTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGG particle and ACACACATAGTGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAA delivered to CCTGGGGCCAGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCT humans or TTAGGAGACCCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTG mice CTCAGGCTTTGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCT CCCCGGCGCTCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAG CTGCGCGCCCTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGG TCGACGTGGCTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAG AAATGGAGTTCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTC CTGAGACTCAGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCAC ACGACTCCCTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAA AGAACCCGAAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCC CGAGCGCCCAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGAC AGGTGCGGTTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGC GGTGACCCTCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTC AGGAGGGGCAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCG TTACCTGGGACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCC GGGCGGGGGCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCC CAACACCCAAATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGC GCGGAGGGAATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCG GCCACCGCAGCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCG ACCAGGGCCCGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGG CAGGAGTTGGGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTG AAGGACTCCGGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTC CGCGGATTCGAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCC GTGCCAAGAGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAA TGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTA ATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCA TTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTG CATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGC TAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATA AGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATAC CTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGC CCATCACTTTGGCAAAGAATTGGGATTCGAACATCGATTTAAAGCTATGGAG CAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTT CTGAAGAGGACTTGAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTT GAATGAAATGGAGCAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAG CAAAAGCTCATTTCTGAAGAGGACTTGAATGAAATGGAGAGCTTGGGCGACC TCACCATGGGCCCTAAAAAGAAGCGTAAAGTCGCCCCCCCGACCGATGTCA GCCTGGGGGACGAGCTCCACTTAGACGGCGAGGACGTGGCGATGGCGCAT GCCGACGCGCTAGACGATTTCGATCTGGACATGTTGGGGGACGGGGATTCC CCGGGTCCGGGATTTACCCCCCACGACTCCGCCCCCTACGGCGCTCTGGAT ATGGCCGACTTCGAGTTTGAGCAGATGTTTACCGATGCCCTTGGAATTGACG AGTACGGTGGGGAATTCCCGGGGATCTTGGCAAAGCGCTTTGCCGACTTTA CAGTCTACAGGAACCGCACACTTCAGAAATGGCACGATAAGACCAAACTGG CTTCTGGAAAACTGGGGAAGGGTTTTGGTGCCTTTGAACGCTCAATCTTGAC TCAGATCGACCATATTCTGATGGACAAAGAGAGATTACTTCGAAGGACACAG ACCAAGCGCTCTGTCTATCGAGTTCTTGGCAAACCTGAGCCAGCAGCTCAG CCTGTCCCAGAGAGTTTGCCAGGGGAACCGGAGATCCTTCCTCAAGCCCCT GCTAATGCTCATCTGAAGGACTTGGATGAAGAAATCTTTGATGATGATGACTT TTACCACCAGCTCCTTCGAGAACTCATAGAACGGAAGACCAGCTCCTTGGG GATCCTGGATCGCCCTTACGCCTGCCCTGTGGAATCTTGCGACCGCCGGTT CTCCCGCAGCGATAACCTGGTGCGGCACATCCGGATTCACACCGGCCAGAA ACCTTTCCAGTGCAGGATCTGCATGAGAAATTTCTCCCGGTCCGACCACCTG ACCACCCACAATAGGACCCACACCGGCGAGAAACCCTTTGCCTGCGACATC TGCGGGAGAAAGTTCGCCGACCCCGGCCACCTGGTGAGACACAATAGAATC CACACCGGTGAAAAGCCCTTCGCCTGTCCCGTGGAGAGCTGCGATCGCAGA TTCAGCCGCAGCGACGAGCTGACAAGGCACATCAGAATCCACACCGGGCAG AAGCCTTTTCAGTGCCGGATCTGCATGAGGAACTTCAGCTCCCGGGACGTG CTGAGACGCCACAATCGCACACACACCGGCGAAAAGCCCTTCGCCTGTGAT ATTTGCGGGCGGAAATTTGCCTCCAGAGATGTGCTGCGCCGCCACAACCGC ATTCACCTGAGACAGAACGATCTCGAGAGATCTACGGGTGGCATCCCTGTG ACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCCA CCAGCCTTGTCCTAATAAAATTAAGTTGCATCATTTTGTCTGACTAGGTGTCC TTCTATAATATTATGGGGTGGAGGGGGGTGGTATGGAGCAAGGGGCAAGTT GGGAAGACAACCTGTAGGGCCTGCGGGGTCTATTGGGAACCAAGCTGGAG TGCAGTGGCACAATCTTGGCTCACTGCAATCTCCGCCTCCTGGGTTCAAGC GATTCTCCTGCCTCAGCCTCCCGAGTTGTTGGGATTCCAGGCATGCATGAC CAGGCTCAGCTAATTTTTGTTTTTTTGGTAGAGACGGGGTTTCACCATATTGG CCAGGCTGGTCTCCAACTCCTAATCTCAGGTGATCTACCCACCTTGGCCTCC CAAATTGCTGGGATTACAGGCGTGAACCACTGCTCCCTTCCCTGTCCTTCTG ATTTTGTAGGTAACCACGTGCGGACCGAGCGGCCGC 86 GCGGCCGCACGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACT Sequence of TTACCATAAGTAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCAT mAAV-JZif1 ATATCAGTGATATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTAC that is AAGGTGGACTGGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATAT incorporated AAAGATGAGTTTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAG into viral AGTGCCGCGTCCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGAC particle and CAGCTAAGCCTCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCA delivered to GGTCTAGCCAGTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGG humans or ACACACATAGTGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAA mice CCTGGGGCCAGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCT TTAGGAGACCCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTG CTCAGGCTTTGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCT CCCCGGCGCTCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAG CTGCGCGCCCTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGG TCGACGTGGCTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAG AAATGGAGTTCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTC CTGAGACTCAGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCAC ACGACTCCCTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAA AGAACCCGAAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCC CGAGCGCCCAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGAC AGGTGCGGTTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGC GGTGACCCTCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTC AGGAGGGGCAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCG TTACCTGGGACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCC GGGCGGGGGCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCC CAACACCCAAATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGC GCGGAGGGAATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCG GCCACCGCAGCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCG ACCAGGGCCCGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGG CAGGAGTTGGGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTG AAGGACTCCGGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTC CGCGGATTCGAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCC GTGCCAAGAGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAA TGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTA ATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCA TTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTG CATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGC TAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATA AGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATAC CTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGC CCATCACTTTGGCAAAGAATTGGGATTCGAACATCGATGGTACCGAATTCAA TGGCCGCTGCCAAGGCCGAGATGCAGCTGATGAGCCCCCTGCAGATCAGC GACCCCTTCGGCAGCTTCCCCCACAGCCCCACCATGGACAACTACCCCAAG CTGGAAGAGATGATGCTGCTGAGCAATGGCGCTCCTCAGTTCCTGGGAGCC GCTGGCGCCCCTGAGGGCAGCGGCAGCAATAGCAGCAGCAGCTCTAGCGG CGGAGGCGGAGGGGGAGGCGGCGGAAGCAATAGCTCCAGCTCCAGCAGC ACATTCAATCCACAAGCCGACACCGGCGAGCAGCCCTATGAGCACCTGACC GCCGAGAGCTTCCCCGACATCAGCCTGAACAACGAGAAGGTGCTGGTGGAA ACCAGCTACCCCAGCCAGACCACCCGGCTGCCCCCTATCACCTACACAGGC CGGTTCAGCCTGGAACCCGCCCCTAACAGCGGCAACACCCTGTGGCCCGA GCCCCTGTTTAGCCTGGTGTCCGGCCTGGTGTCTATGACCAACCCCCCTGC CAGCAGCTCCTCTGCCCCAAGCCCTGCCGCCAGCTCTGCCTCTGCCAGCCA GAGCCCTCCACTGAGCTGCGCCGTGCCCAGCAACGACAGCAGCCCCATCTA CAGCGCCGCTCCCACCTTCCCCACCCCCAACACCGACATCTTCCCCGAGCC TCAGAGCCAGGCCTTTCCTGGATCTGCCGGCACCGCCCTGCAGTACCCACC TCCTGCCTATCCTGCCGCCAAGGGCGGCTTCCAGGTGCCCATGATCCCCGA CTACCTGTTCCCCCAGCAGCAGGGCGATCTGGGCCTGGGCACCCCCGACC AGAAGCCTTTCCAGGGCCTCGAAAGCCGGACCCAGCAGCCAAGCCTGACC CCCCTGAGCACCATCAAGGCCTTCGCCACCCAGAGCGGCAGCCAGGACCT GAAGGCCCTGAACACCAGCTACCAGAGCCAGCTGATCAAGCCCAGCCGGAT GCGGAAGTACCCCAACCGGCCCAGCAAGACCCCCCCACACGAGAGGCCTT ACGCCTGCCCCGTGGAAAGCTGCGACAGACGGTTCAGCAGAAGCGACGAG CTGACCCGGCACATCCGGATCCACACCGGCCAGAAACCCTTCCAGTGCCGG ATCTGCATGCGGAACTTCAGCAGCCGGGACGTGCTGCGGCGGCACAATAGA ACCCACACAGGCGAGAAGCCCTTCGCCTGCGACATCTGCGGCCGGAAGTTC GCCAGCAGAGATGTGCTGCGGAGACACAACAGGATCCACCTGAGACAGAAG GACAAGAAAGCCGACAAGAGCGTGGTCGCCAGCAGCGCTACCAGCAGCCT GAGCAGCTACCCTTCTCCTGTGGCCACCTCCTACCCAAGCCCAGTGACCAC AAGCTACCCATCCCCCGCCACCACCTCTTATCCCAGCCCCGTGCCTACCAG CTTCAGCTCTCCCGGCAGCTCCACATACCCCAGCCCTGTGCATAGCGGCTT CCCTAGCCCTAGCGTGGCCACCACATACAGCAGCGTGCCCCCTGCCTTCCC AGCTCAAGTGTCCAGCTTCCCCAGCTCCGCCGTGACCAACAGCTTCAGCGC CAGCACCGGCCTGAGCGACATGACCGCCACCTTCAGCCCCCGGACCATCG AGATCTGCTGACTCGAGAGATCTACGGGTGGCATCCCTGTGACCCCTCCCC AGTGCCTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACCAGCCTTGT CCTAATAAAATTAAGTTGCATCATTTTGTCTGACTAGGTGTCCTTCTATAATAT TATGGGGTGGAGGGGGGTGGTATGGAGCAAGGGGCAAGTTGGGAAGACAA CCTGTAGGGCCTGCGGGGTCTATTGGGAACCAAGCTGGAGTGCAGTGGCA CAATCTTGGCTCACTGCAATCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGC CTCAGCCTCCCGAGTTGTTGGGATTCCAGGCATGCATGACCAGGCTCAGCT AATTTTTGTTTTTTTGGTAGAGACGGGGTTTCACCATATTGGCCAGGCTGGT CTCCAACTCCTAATCTCAGGTGATCTACCCACCTTGGCCTCCCAAATTGCTG GGATTACAGGCGTGAACCACTGCTCCCTTCCCTGTCCTTCTGATTTTGTAGG TAACCACGTGCGGACCGAGCGGCCGC 87 GCGGCCGCACGCGTCACCAACTGGGTAACCTCTGCTGACCCCCACTCTACT Sequence of TTACCATAAGTAGCTCCAAATCCTTCTAGAAAATCTGAAAGGCATAGCCCCAT mAAV-JZif2 ATATCAGTGATATAAATAGAACCTGCAGCAGGCTCTGGTAAATGATGACTAC that is AAGGTGGACTGGGAGGCAGCCCGGCCTTGGCAGGCATCATCCTCTAAATAT incorporated AAAGATGAGTTTGTTCAGCCTTTGCAGAAGGAAAAACTGCCACCCATCCTAG into viral AGTGCCGCGTCCTTGTCCCCCCACCCCCTCCAATTTATTGGGAGGAAGGAC particle and CAGCTAAGCCTCATCTAGGAAGAGCCCCTCACCCATCTCCACCTCCACTCCA delivered to GGTCTAGCCAGTCCTGGGTTGTGACCCTTGTCTTTCAGCCCCAGGAGAGGG humans or ACACACATAGTGCCACCAAAGAGGCTGGGGGAGGGCCTCAGCCCACCAAAA mice CCTGGGGCCAGTGCGTCCTACAGGAGGGGAACCCTCACCCCTTCAATCCCT TTAGGAGACCCAAGGGCGCTGCGCGTCCCTGAGGCGGACAGCTCCGTGTG CTCAGGCTTTGCGCCTGACAGGCCTATCCCCGGGAGCCCCCGCGCCTCCT CCCCGGCGCTCCGCCCTCGCCTCCCCCCGCCAGTTGTCTATCCTGCGACAG CTGCGCGCCCTCCGGCCGCCGGTGGCCCTCTGTGCGGTGGGGGAAGGGG TCGACGTGGCTCAGCTTTTTGGATTCAGGGAGCTCGGGGGTGGGAAGAGAG AAATGGAGTTCCAGGGGCGTAAAGGAGAGGGAGTTCGCCTTCCTTCCCTTC CTGAGACTCAGGAGTGACTGCTTCTCCAATCCTCCCAAGCCCACCACTCCAC ACGACTCCCTCTTCCCGGTAGTCGCAAGTGGGAGTTTGGGGATCTGAGCAA AGAACCCGAAGAGGAGTTGAAATATTGGAAGTCAGCAGTCAGGCACCTTCC CGAGCGCCCAGGGCGCTCAGAGTGGACATGGTTGGGGAGGCCTTTGGGAC AGGTGCGGTTCCCGGAGCGCAGGCGCACACATGCACCCACCGGCGAACGC GGTGACCCTCGCCCCACCCCATCCCCTCCGGCGGGCAACTGGGTCGGGTC AGGAGGGGCAAACCCGCTAGGGAGACACTCCATATACGGCCCGGCCCGCG TTACCTGGGACCGGGCCAACCCGCTCCTTCTTTGGTCAACGCAGGGGACCC GGGCGGGGGCCCAGGCCGCGAACCGGCCGAGGGAGGGGGCTCTAGTGCC CAACACCCAAATATGGCTCGAGAAGGGCAGCGACATTCCTGCGGGGTGGC GCGGAGGGAATGCCCGCGGGCTATATAAAACCTGAGCAGAGGGACAAGCG GCCACCGCAGCGGACAGCGCCAAGTGAAGCCTCGCTTCCCCTCCGCGGCG ACCAGGGCCCGAGCCGAGAGTAGCAGTTGTAGCTACCCGCCCAGGTAGGG CAGGAGTTGGGAGGGGACAGGGGGACAGGGCACTACCGAGGGGAACCTG AAGGACTCCGGGGCAGAACCCAGTCGGTTCACCTGGTAAGCTTGCTAGCTC CGCGGATTCGAATCCCGGCCGGGAACGGTGCATTGGAACGCGGATTCCCC GTGCCAAGAGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACAAAAAA TGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTTTCCCTA ATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGCACCA TTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTTCTG CATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATTGC TAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATA AGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATAC CTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGC CCATCACTTTGGCAAAGAATTGGGATTCGAACATCGATGGTACCGAATTCAA TGGCCGCTGCCAAGGCCGAGATGCAGCTGATGAGCCCCCTGCAGATCAGC GACCCCTTCGGCAGCTTCCCCCACAGCCCCACCATGGACAACTACCCCAAG CTGGAAGAGATGATGCTGCTGAGCAATGGCGCTCCTCAGTTCCTGGGAGCC GCTGGCGCCCCTGAGGGCAGCGGCAGCAATAGCAGCAGCAGCTCTAGCGG CGGAGGCGGAGGGGGAGGCGGCGGAAGCAATAGCTCCAGCTCCAGCAGC ACATTCAATCCACAAGCCGACACCGGCGAGCAGCCCTATGAGCACCTGACC GCCGAGAGCTTCCCCGACATCAGCCTGAACAACGAGAAGGTGCTGGTGGAA ACCAGCTACCCCAGCCAGACCACCCGGCTGCCCCCTATCACCTACACAGGC CGGTTCAGCCTGGAACCCGCCCCTAACAGCGGCAACACCCTGTGGCCCGA GCCCCTGTTTAGCCTGGTGTCCGGCCTGGTGTCTATGACCAACCCCCCTGC CAGCAGCTCCTCTGCCCCAAGCCCTGCCGCCAGCTCTGCCTCTGCCAGCCA GAGCCCTCCACTGAGCTGCGCCGTGCCCAGCAACGACAGCAGCCCCATCTA CAGCGCCGCTCCCACCTTCCCCACCCCCAACACCGACATCTTCCCCGAGCC TCAGAGCCAGGCCTTTCCTGGATCTGCCGGCACCGCCCTGCAGTACCCACC TCCTGCCTATCCTGCCGCCAAGGGCGGCTTCCAGGTGCCCATGATCCCCGA CTACCTGTTCCCCCAGCAGCAGGGCGATCTGGGCCTGGGCACCCCCGACC AGAAGCCTTTCCAGGGCCTCGAAAGCCGGACCCAGCAGCCAAGCCTGACC CCCCTGAGCACCATCAAGGCCTTCGCCACCCAGAGCGGCAGCCAGGACCT GAAGGCCCTGAACACCAGCTACCAGAGCCAGCTGATCAAGCCCAGCCGGAT GCGGAAGTACCCCAACCGGCCCAGCAAGACCCCCCCACACGAGAGGCCTT ACGCCTGCCCCGTGGAAAGCTGCGACAGACGGTTCAGCAGAAGCGACAAC CTGGTCCGGCACATCCGGATCCACACCGGCCAGAAACCCTTCCAGTGCCGG ATCTGCATGCGGAACTTCTCTCGGAGCGACCACCTGACCACCCACATCAGA ACCCACACAGGCGAGAAGCCCTTCGCCTGCGACATCTGCGGCCGGAAGTTC GCCGACCCCGGCCACCTCGTCAGACACAACAGGATTCACCTGAGACAGAAG GACAAGAAAGCCGACAAGAGCGTGGTCGCCAGCAGCGCTACCAGCAGCCT GAGCAGCTACCCTTCTCCTGTGGCCACCTCCTACCCAAGCCCAGTGACCAC AAGCTACCCATCCCCCGCCACCACCTCTTATCCCAGCCCCGTGCCTACCAG CTTCAGCTCTCCCGGCAGCTCCACATACCCCAGCCCTGTGCATAGCGGCTT CCCTAGCCCTAGCGTGGCCACCACATACAGCAGCGTGCCCCCTGCCTTCCC AGCTCAAGTGTCCAGCTTCCCCAGCTCCGCCGTGACCAACAGCTTCAGCGC CAGCACCGGCCTGAGCGACATGACCGCCACCTTCAGCCCCCGGACCATCG AGATCTGCTGACTCGAGAGATCTACGGGTGGCATCCCTGTGACCCCTCCCC AGTGCCTCTCCTGGCCCTGGAAGTTGCCACTCCAGTGCCCACCAGCCTTGT CCTAATAAAATTAAGTTGCATCATTTTGTCTGACTAGGTGTCCTTCTATAATAT TATGGGGTGGAGGGGGGTGGTATGGAGCAAGGGGCAAGTTGGGAAGACAA CCTGTAGGGCCTGCGGGGTCTATTGGGAACCAAGCTGGAGTGCAGTGGCA CAATCTTGGCTCACTGCAATCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGC CTCAGCCTCCCGAGTTGTTGGGATTCCAGGCATGCATGACCAGGCTCAGCT AATTTTTGTTTTTTTGGTAGAGACGGGGTTTCACCATATTGGCCAGGCTGGT CTCCAACTCCTAATCTCAGGTGATCTACCCACCTTGGCCTCCCAAATTGCTG GGATTACAGGCGTGAACCACTGCTCCCTTCCCTGTCCTTCTGATTTTGTAGG TAACCACGTGCGGACCGAGCGGCCGC

OTHER EMBODIMENTS

Various modifications and variations of the described compositions and methods of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in the art are intended to be within the scope of the invention. All publications, patents, and patent applications mentioned in the above specification are hereby incorporated by reference. 

What is claimed is:
 1. A recombinant adeno-associated vector (AAV) for expression of a gene in skeletal or cardiac muscle tissue comprising a muscle-specific promoter and a gene encoding a fusion protein, wherein said fusion protein comprises: a) a transcriptional activation element and b) a DNA binding element, wherein said DNA binding element binds to a promoter of the utrophin gene, wherein said utrophin promoter is a mouse or human utrophin “A” promoter, wherein said fusion protein, when expressed in said skeletal or cardiac muscle tissue, is capable of increasing utrophin expression, increasing muscle contractile force, or increasing muscle endurance, and wherein said vector comprises the sequence of SEQ ID NO:83, 84, or
 85. 2. The vector of claim 1, wherein said muscle-specific promoter is alpha-actin.
 3. The vector of claim 1, wherein said vector further comprises at least one element selected from a group consisting of an inverted terminal repeat, a cap signal, a multicloning site, an intron splice-donor site, an intron splice-acceptor site, an epitope tag, a nuclear localization sequence, and a polyadenylation consensus sequence.
 4. The vector of claim 1, wherein said mouse or human utrophin “A” promoter comprises the sequence of SEQ ID NO:9 or SEQ ID NO:14.
 5. The vector of claim 1, wherein said human utrophin “A” promoter comprises the sequence of SEQ ID NO:10 or SEQ ID NO:12 or said mouse utrophin “A” promoter comprises the sequence of SEQ ID NO:11 or SEQ ID NO:
 13. 6. The vector of claim 1, wherein said muscle-specific promoter is constitutively expressable throughout differentiation.
 7. The vector of claim 6, wherein the muscle-specific promoter is selected from the group consisting of alpha-actin, cardiac troponin C, myosin light chain 2 A, skeletal beta-actin, CK 6, dystrophin, muscular creatine kinase, dMCK, tMCK, enh348MCK, synthetic C5-12 (Syn), Myf5, MLC1/3f. MyoD1, Myog, and Pax7.
 8. The vector of claim 1, wherein said vector has a serotype selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, and AAV9.
 9. A composition comprising the vector of claim 1 and a pharmaceutically acceptable carrier. 